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TITLE: METHODS FOR PERFORMING DAF DATA FILTERING 

AND PADDING 



BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to methods using distributed approximating 
functional (DAF), DAF-wavelets and DAF-neural networks for filtering, denoising, 
processing, restoring, enhancing, padding, or other signal processing procedures directed 
to images, signals, ID, 2D, 3D . . . nD spectra, X-ray spectra, CAT scans; MRI scans, 
NMR, and other applications that require data processing at or near the theoretical limit 
of resolutions. 

More particularly, the present invention relates to the use of infinitely smooth 
DAFs in combination with other signal processing techniques to provide methods and 
apparatuses utilizing such methods that can enhance image, spectral, or other signal data 
and decrease the time need to acquire images, spectra or other signals. 

2. Description of the Related Art 

Many techniques currently exist for processing images, refining spectra, 
analyzing data or the like. Many of these techniques are well-known and used 
extensively. However, these techniques generally suffer from one or more limitation on 
their ability to enhance signal or image and construct or restore missing or lost data, 
especially if the user desires the error inherit in signal acquisition and the error introduced 
by the processing technique to be as small as possible, i.e., as close as possible to 
Heisenberg's uncertainty principle. 

Thus, there is a need in the art for improved techniques for processing acquired 
data whether in the form on a image, a spectra, a multidimensional spectra or the like so 
that the error due to processing can be minimized which can increase resolution and 
decrease acquisition times. 
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SUMMARY OF THE INVENTION 

The present invention provides a method implemented on a digital processing 
device or stored in a memory readable by a digital processing unit which uses distributed 
approximating fimctionals (DAFs) to enhance and improve signal, image and multi- 
5 dimensional data constructs processing and to decrease acquisition time of real world 

spectrometric techniques that operate on square wave type frequency domains which 
require a large acquisition time to capture the signal from noise. Short the acquisition 
time, the more noise and less resolution and definition the spectra will have. The limit 
for simultaneously acquiring frequency information in time is given by a variant of 

10 Heisenberg's uncertainty principle, i.e., AcoAts 1 . The methods of the present provides 

methods for image and signal processing where the accuracy and precision of the final 
signal and image closely approaches the uncertainty principles maximum accuracy and 
precision. The methods can be made to approach uncertainty principle accuracy via 
increased computational cost, but the real power of the methods of this invention is to 

15 yield improved measurements at a given Ao> and At 

product. 

The present invention also provides methods for improving X-ray and magnetic 
imaging techniques, especially mammogram images using the D AF and DAF processing 
techniques set forth herein. 
20 The present invention also provides a mammogram imaging system of 

conventional design, the X-ray data derived thereform is then enhanced via DAF 
processing in an associated digital processing unit. 
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DESCRIPTION OF THE DRAWINGS 

The invention can be better understood with reference to the following detailed 
description together with the appended illustrative drawings in which like-elements are 
numbered the same. The figure are heading dependent. Each major heading set forth by 
5 a all caps, embolded and underlined text has its own figures each restarting each 

numbering sequence with 1. Each major heading also restarts equation numbering. 
DAF TREATMENT OF NOISY SIGNALS 

Figure 1 depicts the Hermite DAF in (a) coordinate space and (b) frequency space 
respectively. The solid line is for $M=88,~\sigma=3.05$ and the dashed line is for Af=12, 
10 ~o=4. The solid line is close to the interpolation region and the dashed line is in the low 

pass filtering region. The frequency in (b)has been multiplied by a factor of the grid 
spacing. 

Figure 2 depicts extrapolation results for the function in Eq. (15). The solid line is the 
exact function. The long dashed line, the short dashed line and the plus symbols are the 
15 extrapolation results for Hermite DAF parameters o/A=7.6, -7.8, and 8.0 respectively. 

In our numerical application, only the values at the grid points less than - 1 .2 are assumed 
to be known. 

Figure 3 depicts sine function in Eq. (17) with 50% random noise added to the values at 
even spaced grids from 0 to 219 (solid line) and the periodically extended 36 function 
20 values (plus symbols) with o/A=l 0.5. The exact values in the extended domain are also 

plotted (solid line) in this figure. 

Figure 4 depicts the L„ error and (b) the signature of the periodic padding of the noisy 
sine function as a function of DAF parameter o/A. The M is fixed to be-6. 
Figure 5(a) depicts the L m error and Figure 5(b) the signature of the DAF smoothing to 
25 the periodically extended noisy sine function as a function of o/A. The M is fixed to be 

12. 

Figure 6 depicts periodic extension of the nonperiodic function (with noised added) in 
Eq. (15). (a) The 220 known values of the function in the range [-7, 1 0] with 20% random 
noise (solid line) and the 36 exact values of the function(dashed line). Note that the 
30 function is not periodic at all in the range of 256 grid points, (b) The- periodically 

extended function with o/A=5.2. Note the smoothness' and periodic property of the 
function. 
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Figure 7 depicts the periodic extension signature of the noisy function in Figure 5as a 
function of o/A. The M is fixed to be 6. 

Figure 8(a) depicts the L m error and Figure 8(b) the signature of the DAF SMOOTHING 
to the periodically extended noisy function in Figure 6(b) as a function of o/A. The Mis 
5 fixed to be 12for the DAF-smoothing. 

Figure 9 depicts a comparison of DAF smoothed result to the signal in Figure 6(b) at 
o/A=9 (solid line), and with the exact function (dashed line) in Eq. (15). 
GENERALIZED SYMMETRIC INTERPOLATING WAVELETS 
Figure 1 . ^band-limited interpolating wavelets (a) Sine function and (b) Sinclet wavelet; 
10 Figure2. Interpolating Cardinal Spline (D=5)\ . * - 

Figure3. Interpolating wavelets by auto-correlation shell (D=3) (a) Daubechies wavelet 
(b) Dubuc wavelet; 
Figure 4. Lifting scheme 

Figure5 . Lagrange Wavelets with Z)=3 (a) Scaling function (b) Wavelet (c) Dual scaling 
15 function (d) Dual wavelet; 

Figure6. Frequency Response of Equivalent Filters (D=3) (a) Decomposition (b) 
Reconstruction; 

Figure7. Non-regularized Lagrange Wavelets (Af=5) (a) Lagrange polynomial (b) 
Scaling function (c) Wavelet (d) Dual scaling function (e) Dual wavelet 
20 Figure8. B-Spline Lagrange DAF Wavelets (AN, h=2) (a) Scaling function (b) 

Wavelet (c) Dual scaling function (d) Dual wavelet 

Figured Frequency Response of Equivalent Filters (N=4 y h=2) (a) Decomposition (b) 
Reconstruction 

FigurelO. Mother wavelet comparison (W=4, h=2) Solid: B-spline Lagrange; dotted: 
25 Gaussian-Lagrange 

Figure 1 1 . Gibbs overshoot of the Sine FIR 

Figurel2. Sine Cutoff Wavelets (M=9) (a) Scaling (b) Wavelet (c) Dual scaling (d) 
Dual wavelet 

Figurel3. B-Spline Lagrange DAF Wavelets (A^=5,h=3) (a) Scaling (b) Wavelet (c) 
30 Dual scaling (d) Dual wavelet 

Figurel4. Frequency Response of Equivalent Filters (M=5, h=3) (a) Decomposition (b) 
Reconstruction 
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Figurel 5. Mother Wavelet Comparison (A*=4, h=2) Solid: B-spline Sine; dotted: 
Gaussian Sine 

Figure 16 Dubuc wavelets (Z>=3) (a) Inner scaling (b) Boundary scaling . - 
Figurel 7 Generalized Sine wavelets (Af=2, h=2) (a) Inner scaling (b) Boundary scaling 
5 . Figure 18 Frequency response comparison of boundary filters (a) Halfband Lagrange 

wavelet (b) Sinc-DAF wavelet 

Figurel 9 Target extraction from color background (a) Original pilot view (b) DAF- 
wavelet restoration 

Figure20. Visual Color Image Restoration (a) Noisy girl (b) Our restoration 
10 Figure21. Enhancement of Database 1 (a) Original Mammogram " "- (b) Linear 

enhancement (c) Non-linear enhancement 

Figure22. Enhancement of Database 2 (a) Original mammogram (b) Enhancement 
DUAL WINDOW SELECTIVE AVERAGING FILTER 

Figure 1 depicts the filtering of blocked signal corrupted by Gaussian noise: (a) 
15 noise-free blocks signal; (b) noisy signal (MSE=1 .00, MAE=0.80); and (c) filtered signal 

by DWSAF (MSE=6.19E-2, MAE=6.49E-3). 

Figure 2 depicts image restoration from lower noise Lena image: (a) corrupted image 
(PSNR=22.17 dB) and (b) restored image by DWSAF (PSNR=30.69 dB). 
Figure 3 depicts image restoration from higher noise Lena image: (a) corrupted image 
20 (PSNR=1 8.82 dB) and (b) restored image by DWSAF (PSNR=28.96 dB). 

LAGRANGE WAVELETS FOR SIGNAL PROCESSING 
Figure 1 depicts u. band-limited interpolating wavelets (a) Sine function --_ (b) Sinclet 
wavelet; 

Figure 2 depicts Interpolating Cardinal Spline (D=5); 
25 Figure 3 depicts Interpolating wavelets by auto-correlation shell (D=3) (a) Daubechies 

wavelet (b) Dubuc wavelet 
Figure 4 depicts Lifting scheme 

Figure 5 depicts Lagrange Wavelets with D=3 (a) Scaling (b) Wavelet (c) Dual scaling 
(d) Dual wavelet 

30 Figure 6 depicts Frequency Response of Equivalent Filters (D=3) (a) Decomposition (b) 

Reconstruction 

Figure 7 depicts Lagrange Wavelets with D=9 (a) Scaling (b) Wavelet (c) Dual scaling - 



WO 99/46731 



PCI7US99/05426 



7 

(d) Dual wavelet 

Figure 8 depicts Frequency Response of Equivalent Filters (D=9) (a) Decomposition (b) 

Reconstruction - 

Figure 9 depicts Non-regularized Lagrange Wavelets (M=5) (a) Lagrange polynomial 

(b) Scaling (c) Wavelet (d) Dual scaling (e) Dual wavelet 

Figure 10 depicts B-Spline Lagrange DAF Wavelets (N=4, h=2) (a) Scaling (b) 
Wavelet (c) Dual scaling (d) Dual wavelet 

Figure 11 depicts Frequency Response of Equivalent Filters (Af=4, h=2) (a) 
Decomposition (b) Reconstruction 

Figure 12 depicts Mother Wavelet Comparison (7/=4, h=2) Solid: B-splme Lagrange; 

dotted: Gaussian Lagrange * . * 

Figure 13 depicts Nonlinear Masking Functionals (a) Donoho Hard Logic Nonlinearity 

(b) Softer Logic Nonlinearity 
Figure 14 depicts 2D Lagrange wavelets for image processing (a) Scaling, (b) Vertical, 

(c) Horizontal and (d) Diagonal wavelets 

Figure 15 depicts VGN image processing for Lena (a) Noisy Lena (b) Median filtering 
result (c) our method 

Figure 16 depicts VGN processing for Barbar (a) Noisy Barbara (b) Median Filtering 
result (c) Our method 

IMAGE ENHANCEMENT NORMALIZATION 

Figure 1(a) Original Mammogram and Figure 2(b) Enhanced result 

Figure 2(a) Original Mammogram and Figure 2(b) Enhanced result ' 

VARYING WEIGHT TRIMMED MEAN FILTERFORTHE RESTORATION OF 

IMPULSE CORRUPTED IMAGES 

Figure 1 depicts the weight function (5) for A=2 

Figure 2 depicts image restoration from 40% impulse noise corrupted Lena image. Figure 
2(a) shows the original Lena picture; Figure 2(b) shows noise image; Figure 2(c) shows 
median Filtering (3X3), PSNR=28.75; Figure 2(d) a-TMF (3X3), PSNR=27.49; Figure 
2(e) VWTMF (3X3), PSNR=29.06; and Figure 2(f) VWTMF- switch (3X3), 
PSNR=31.43. 

A NEW NONLINEAR IMAGE FILTERING TECHNIQUE 

Figure 1 Image restoration from 60% impulse noise, (a) Corrupted image, (b) Filtering* 
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by Sun and Nevou's median switch scheme, (c) Our filtering, (d) Our modified filtering. 
Figure 2 Image restoration from 40% impulse noise, (a) Corrupted image, (b) Median 
filtering (3 '3).(c) Median filtering (5 '5). (d) Our filtering. 

BIOMEDICAL SIGNAL PROCESSING USING A NEW CLASS OF WAVELETS 
5 Figure 1 

Figure 2a 
Figure 2b 
Figure 3 
Figure 4(a) 
10 Figure 4(b) 

Figure 4(c) 

VISUAL MULTIRESOLUTION COLOR IMAGE RESTORATION 

Figure 1 
Figure 2 
15 Figure 3 

NONLINEAR QUINCUNX FILTERS 
Figure 1 
Figure 2 
Figure 3 

20 Figures 4, 4(a), 4(b) 

Figures 5, 5(a), 5(b) 
Figures 6(a-c) 

MAMMOGRAM ENHANCEMENT USING GENERALIZED SINC WAVELETS 
Figure 1. % band-limited interpolating wavelets 
25 (a) Sine function (b) Sinclet wavelet 

Figure2. Fourier Gibbs overshot of Sine FIR implementation 

Figure 3. Sine Cutoff Wavelets (M=9) (a) Scaling (b) Wavelet (c) Dual scaling (d) 
Dual wavelet 

Figure4. B-Spline Lagrange DAF Wavelets (A*=5, h=3) (a) Scaling (b) Wavelet (c) 
30 Dual scaling (d) Dual wavelet 

Figures. Frequency Response of Equivalent Filters (#=5, h=3) (a) Decomposition (b) 
Reconstruction 
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Figure 6. Mother Wavelet Comparison (AMt, h=2) Solid: B-spline Sine; dotted: 
Gaussian Sine 

Figure 7. Nonlinear Masking Functional (a) Donoho Hard Logic Nonlinearity (b) 
Softer Logic Nonlinearity 

Figure 8. Mammogram enhancement (a) Original mammogram (b) Multiresolution 
enhancement by DAF-wavelet 

VISUAL MULTIRESOLUTION COLOR IMAGE RESTORATION 

Figure 1. Cube model of RGB color 
Figure2. Alternative representation 
Figure3. Hexagon projection of color tube 

Figure4 Tested Result (1) (a) Noisy Lena (b) Median filtering (c) VGN restoration 
Figures. Tested Result (2) (a) Noisy girl (b) Median filtering (c) VGN restoration 
DUAL PROPAGATION INVERSION OF FOURIER AND LAPLACE SIGNALS 
Fig. 1 The auxiliary function, (t;a,a> 0), at t=0, as a function of the frequency o_0. 
Fig. 2 The truncated sine function f(G>)=sin(a>); OsQfApi, and the calculated spectrum 
obtained by the dual propagation inversion procedure. The noiseless time domain signal 
was sampled between -45*^45. The two are visually indistinguishable. 
Fig. 3 Dotted line: The calculated spectrum f(o)) obtained from the time signal corrupted 
by random noise of 20%. Solid line: The calculated spectrum obtained from the 
noise-free time signal. Both clean and corrupted signals were sampled between -45 st*45. 
Fig. 4 Cross-hatched line: the calculated spectrum / DPI (co) obtained from the noise-free 
time domain signal, sampled between -5st*5. Solid line is the original truncated sine 
function. 

Fig. 5 Cross-hatched line: the calculated spectrum yi> PI ((*>) obtained using the 
DAF-padding values for 5^|t|^7.5, joined smoothly to the analytical tail-function (see 
text). Careful comparison with Fig. 4 shows a reduction of the aliasing due to signal 
truncation. 
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DETAILED DESCRIPTION OF THE INVENTION 

The inventors have found that a signals, images and multidimensional imaging 
data can be processed at or near the uncertainty principle limits with DAFs and various 
adaptation thereof which are described in the various section of thjs disclosure. 

5 

DAF TREATMENT OF NOISY SIGNALS 
INTRODUCTION 

Experimental data sets encountered in science and engineering contain noise due 
to the influence of internal interferences and/or external environmental conditions. 

10 Sometimes the noise must be identified and removed in order to seethe true signal 

clearly, to analyze it, or to make further use of it. 

Signal processing techniques are now widely applied not only in various fields 
of engineering but also in physics, chemistry, biology, and medicine. Example problems 
of interest include filter diagonalization, solvers for eigenvalues and eigenstates [1-3], 

15 solution of ordinary and partial differential equations [4-5], pattern analysis, 

characterization, and denoising [6-7], and potential energy surface fitting [8]. One of the 
most important topics in signal processing is filter design. Frequency-selective filters are 
a particularly important class of linear, time-invariant (LTI) analyzers [9]. For a given 
experimental data set, however, some frequency selective finite impulse response (FIR) 

20 filters require a knowledge of the signal in both the unknown "past" and "future" 

domains. This is a tremendously challenging situation when one attempts to analyze the 
true signal values near the boundaries of the known data set. Direct application of this 
kind of filter to the signal leads to aliasing; i.e., the introduction" of additional, 
nonphysical frequencies to the true signal, a problem called "aliasing" [9]. Additionally, 

25 in the implementation of the fast Fourier transform (FFT) algorithm [9], it is desirable 

to have the number of data values or samples to be a power of 2. However, this condition 
is often not satisfied for a given set of experimental measurements, so one must either 
delete data points or augment the data by simulating in some fashion, the unknown data. 
Determining the true signal by extending the domain of experimental data is 

30 extremely difficult without additional information concerning the signal, such as a 

knowledge of the boundary conditions. It is an even tougher task, using the typical 
interpolation approach, when the signal contains noise. Such interpolation formulae 
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necessarily return data values that are exact on the grids; but they suffer a loss of 
accuracy off the grid points or in the unknown domain, since they reproduce the noisy 
signal data exactly, without discriminating between true signal and the. noise. In this 
paper, an algorithm is introduced that makes use of the well-tempered property of 
5 "distributed approximating functional" (DAFs) [10-13]. The basic idea is to introduce 

a pseudo-signal by adding gaps at the ends of the known data, and assuming the 
augmented signal to be periodic. The unknown gap data are determined by solving linear 
algebraic equations that extremize a cost function. This procedure thus imposes a 
periodic boundary condition on the extended signal. Once periodic boundary conditions 

10 are enforced, the pseudo-signal is known everywhere and can be used for a variety of 

numerical applications. The detailed values in the gap are usually not of particular 
interest. The advantage of the algorithm is that the extended signal adds virtually no 
aliasing to the true signal, which is an important problem in signal processing. Two of 
the main sources of aliasing are too small a sampling frequency and truncation of the 

1 5 signal duration. Another source of error is contamination of the true signal by numerical 

or experimental noise. Here we are concerned only with how to avoid the truncation 
induced and noisy aliasing of the true signal. 

Distributed approximating functions (DAFs) were recently introduced [ 1 0- 1 1 ] as 
a means of approximating continuous functions from values known only on a discrete 

20 sample of points, and of obtaining approximate linear transformations of the function (for 

example, its derivatives to various orders). One interesting feature of a class of 
commonly-used DAFs is the so-called well-tempered property [13]; it is the key to the 
use of DAFs as the basis of a periodic extension algorithm. DAFs differ from the most 
commonly used approaches in that there are no special points in the DAF approximation; 

25 L e. , the DAF representation of a function yields approximately the same level of accuracy 

for the function both on and off the grid points. However, we remark that the 
approximation to the derivatives is not, in general quite as accurate as the DAF 
approximation to the function itself because the derivatives of L 2 -functions contain an 
increased contribution from high frequencies. By contrast, most other approaches yield 

30 exact results for the function on the grid points, but often at the expense of the quality of 

the results for the function elsewhere [13]. DAFs also provide a well-tempered 
approximation to various linear transformations of the function. DAF representations of* 
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derivatives to various orders will yield approximately similar orders of accuracy as long 
as the resulting derivatives remain in the DAF class. The DAF approximation to a 
function and a finite number of derivatives can be made to be of machine accuracy with 
a proper choice of the DAF parameters. DAFs are easy to apply because they yield 

5 integral operators for derivatives. These important features of DAFs have made them 

successful computational tools for solving various linear and nonlinear partial differential 
equations (PDEs) [14-17], for pattern recognition and analysis [6], and for potential 
energy surface fitting [8]. The well-tempered DAFs also are low-pass filters. In this 
paper, the usefulness of DAFs as low pass filters is also studied when they are applied 

0 to a periodically extended noisy signal. For the present purpose, we assumethat the weak 

noise is mostly in the high frequency region and the true signal is bandwidth limited in 
frequency space, and is larger than the noise in this same frequency region. To determine 
when the noise is eliminated, we introduce a signature to identify the optimum DAF 
parameters. This concept is based on computing the root-mean-square of the smoothed 

5 data for given DAF parameters. By examining its behavior as a function of the DAF 

parameters, it is possible to obtain the overall frequency distribution of the original noisy 
signal. This signature helps us to periodically extend and filter noise in our test examples. 

The first example is a simple, noisy periodic signal, for which the DAF periodic 
extension is a special case of extrapolation. The second is a nonperiodic noisy signal. 

0 After performing the periodic extension and filtering, it is seen that the resulting signal 

is closely recreates the true signal. 



DISTRIBUTED APPROXIMATING FUNCTIONALS 

DAFs can be viewed as "approximate identity kernels" used to approximate a 
25 continuous function in terms of discrete sampling on a grid [10-13]. One class of DAFs 

that has been particularly useful is known as the well-tempered DAFs. which provide an 
approximation to a function having the same order of accuracy both on and off the grid 
points. A particularly useful member of this class of DAFs is constructed using Hermite 
polynomials, and prior to discretization, is given by 
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where o, M are the DAF parameters and H 2n is the usual (even) Hermite polynomial. The 
HermiteDAF is dominated by its Gaussian envelope, exp(-(jc-x*) 2 /2 a 2 ), which effectively 
determines the extent of the function. The continuous, analytic approximation to a 
function^*) generated by the Hermite DAF is 

roo (2) 
/(*) ~ Idaf(*) = / M* - *>)/(*')<**'. 

J —CO 

Given a discrete set of functional values on a grid, the DAF approximation to the 
function at any point x (on or off the grid) can be obtained by 

f DAF (z) = A£M* - P) 
i 

where A is the uniform grid spacing (non-uniform and even random sampling can also 
be used by an appropriate extension of the theory). The summation is over all grid points 
(but only those close to x effectively contribute). Similarly, for a two-dimensional 
function^**}'), one can write 

W(z,y) = A x A y £ S Ml (x - x h \<r x )8^{y - y h \<r 2 )f{p h t y h ), ( 4 ) 

using a simple direct product. In Figure 1, we plot Hermite DAFs obtained with two 
different sets of parameters, in (a) coordinate space, and (b) frequency space. The solid 
line (o=3.05, M=88) is more interpolative compared to the DAF given by the dashed line 
(o=4, M=12). The latter is more smoothing when applied to those "functions whose 
Fourier composition lies mostly under the o = 3.05, M=88 DAF window. This results 
from the a =4, M=l 2 DAF window being narrower in Fourier space than that of the DAF 
with o=3.05, Af=88. The discretized Hermite DAF is highly banded in coordinate space 
due to the presence of the Gaussian envelope, which means that only a relatively small 
number of values are needed on both sides of the point* in Eq. (3), as can be clearly seen 

from Figure 1(a). This is in contrast to the sine function , sm ^ . From Figure l(b), 

we see that the Hermite DAF is also effectively bandwidth-limited in frequency space. 
With a proper choice of parameters, the Hermite DAF can produce an arbitrarily good 
filter (see the dashed line in Figure 1). Once the boundary condition is fixed for a data set," 
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Eq.(3) or (4) can then be used to eliminate the high frequency noise of that data set. As 
long as the frequency distribution of the noise lies outside the DAF plateau (Figure 1 (b)), 
the Hermite DAF will eliminate the noise regardless of its magnitude. 

The approximate linear transformations of a continuous function can also be 
5 generated using the Hermite DAF. One particular example is derivatives of a function to 

various orders, given by 

/ (0 (*) * USaf = r - *'W)f{*')dx', (5) 

» — OO 

1 0 where S$ (x - x *\a) is the /th derivative of 6 M (x-x* W), with respect to x, and 

is given explicitly by 

2<r> ) 

£ ("i) (-1) '^! F2n+ ' 

15 When uniformly discretized by quadrature, Eq. (5) gives 

' / (0 (*) * /Sf(*) = A £ - xj\<r)f( Xj ) ™ 

j 

Expressions (5) and (7) are extremely useful in solving linear and nonlinear partial 
20 differential equations (PDEs) [14-17] because the differential operation has been 

expressed as an integration. With a judicious choice of the DAF parameters, it is possible 
to provide arbitrarily high accuracy for estimating the derivatives. 



v 2 ~' /2 H«-« 0 a 

— exp 1 — ' 

nr, - ' , , x (6) 



METHOD OF DATA EXTRAPOLATION 
25 Case I. Filling a Gap 

Suppose we have a set of uniformly spaced grid points on the infinite line, and 
a continuous function, ./(*), known on all grid points except for the set {x s , x K }. 
Assuming that fix) is in the DAF-class, we can estimate the unknown- values by 
mmimizing the cost function, 

30 



G= £ W p (f(*r)-fDAF{x p )f. 

p— — OO 



(8) 
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where, W p is a weight assigned to the point x py and in this paper it is chosen to be 1 on a 
finite grid and 0 elsewhere;/^^) * s the DAF approximation to the function at the point 
x p . From Eqs. (3) and (8), we have 

oo / P+tu \ 2 

c = E^p fW - E M*p - *tW)f(*t) . (9) 

p=— oo y t—p—xu J 

where w is the half DAF bandwidth. We minimize this cost function with respect to the 
unknown values, {/(*,), ...^XjJ}, according to 

dC 

3/(x/)~~ 0, J ^ l ^ K > (10) 
to generate the set of linear algebraic equations, 

(p+w \ 
fM- E M*P-*tk)/(*i) (fpi-M«p-*ik)) = o, j<i<k, < n > 
^=p- w / 

where the unknowns are/*,) and/*,) for /?=/ or /=/. The symbol 4, is the kronecker 
delta. Solving these equations yields the predicted values of fix) on the grid points in the 
gap. 

Case II. Extrapolation 

A more challenging situation occurs when/fr,), 1>J are all unknown. In this case, 
for points x p beyond x Ky we specify a functional form for the unknown- J grid values, 
including some embedded variational parameters. It is simplest to- choose linear 
variational parameters, e.g., 

L 

/W«Ea( 8 ), (12) 

but this is not essential, and nonlinear parameters can also be embedded in the <p p (x). The 
choice of functions, <p p (x\ can be guided by any intuition or information about the 
physical behavior of the signal, but even this is not necessary. This introduces additional 
variations of the cost function with respect to the additional parameters, so we impose 

— =0, 1 s Z, and therefore obtain additional equations. We must also specify the 
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choice of the W p when one introduces both a gap and a "tail function". There is enormous 
freedom in how this is done, and e.g., one can choose which points are included and 
which are not. In the present study, we shall take W=l for 1 spzK (i.e., all known data 
points, plus all gap-points), and W p = 1 for all other points(including tail-fixnction points). 
Again, we emphasize that other choices are possible and are under study. 

For case I, our procedure leads to Eq. (1 1) and for case II, to the equations, 

min(/+tu,K) / p+ui 

£ 2 wA f(x,) - X M*, - *<k)/(*,) 

P=f-u» \ t=p-u/ 

x (&pt - M* P - */k)) = o, J <i<k, 
£ - 2W p (f( x p) - £ M* P - *«k)/(« c ) 

P=K"+1 — w \ t=p— iu 

p+w V A_ V 

* £ M*p - = o, 1<P<L 

t=K+l 

These linear algebraic equations can be solved by any of a variety of standard algorithms 
[18]. Note that it is the well-tempered property of the DAFs that underlies the above 
algorithms. For standard interpolation algorithms, the value on each grid point is exact 
and does not depend on the values at other grid points, which means that the cost function 
is always zero irrespective of functional values. 

The suitability of using Hermite DAFs to pad two isolated data sets has been 
tested for fitting one dimensional potential energy surfaces [8]. To explore further the 
algorithm in the case where only one data set is known, we show in Figure 2 the 
extrapolation results for the arbitrarily chosen DAF-class function, 

4 

/(x) = 4 + 2£cos(j aJ ) + 5exp(-( a; -l) 2 ). (15) 

Using values of the function on a discrete grid with uniform spacing, A * 0.024, on the 
domain shown in Figure 2 (solid line),we attempt to determine the function at 100 
uniformly distributed grid points in the range [- 1 .2, 1 .2] . The tail function used isj{x) s 1 , 
multiplied by a linear variational parameter. From Figure 2, it is seen that the predicted 
results are in almost total agreement with the actual function, for all three DAF 
parameters employed, for the points between -1.2^x^0.2- Larger errors occur for those 
x values which are further away from the known data boundary. The source-of error 
simply is that one is forcing the function to join smoothiy with a constant- tail function, 
even though the constant is variationally determined. Had one employed the correct form- 
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for the tail function, with a multiplicative variational factor, the result would be visually 
indistinguishable for all three DAF parameters and the tail-variational constant would 
turn out to be essentially unity. It must be noted that, although we have discussed the 
algorithm in the context of one dimension, extending it to two. or more dimensions is 
straightforward. One way to do this is with a direct product form, as given in Eq. (4). 
However, such a direct two dimensional calculation is a time and memory consuming 
procedure because of the large number of simultaneous algebraic equations that must be 
solved. One alternative is to consider the two dimensional patterns as a grid constructed 
of many one dimensional grids, and then extrapolate each line separately. We expect this 
procedure may be less accurate than the direct two dimensional extrapolation algorithm 
because it only considers the influence from one direction and neglects cross correlation. 
However, for many problems it produces satisfactory results and it is a very economical 
procedure. Additionally, cross correlation can be introduced by DAF-fitting the complete 
(known plus predicted) data set using the appropriate 2D DAF. 

The well-tempered property makes the DAFs powerful computational tools for 
extrapolation of noisy data, since DAFs are low-pass filters and therefore remove high 
frequency noise. In the next section, we will explore the use of the algorithm presented 
here for periodically extending a finite, discrete segment of data which may contain 
noise. 

PERIODIC EXTENSION 

As described in the introduction to this paper, sometimes it is necessary to know 
the boundary conditions for a data set in order to apply noncausal, zero-phase FIR filters 
without inducing significant aliasing. Certain other numerical analyses require that the 
signal contain a number of samples equal to an integer power of 2. However, it is most 
often the case that the boundary conditions for the experimental data are unknown and 
the length of the data stream is fixed experimentally and typically not subject to 
adjustment. 

A pseudo-signal is introduced outside the domain of the nonperiodic experimental 
signal in order to force the signal to satisfy periodic boundary conditions and/or-to have 
the appropriate number of samples. The required algorithm is similar to that for filling 
a gap, as discussed above. One can treat the period-length as a discrete variational- 
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parameter but we don't pursue this here. For a given set of experimental data {/i,^,...,/^. 
!)}, we shall force it to be periodic, with period K, so that K-J+l values f k } 
must be determined. Since the extended signal is periodic, the values {f K +\J K + 2 ,.~J K + M } 
are also, by fiat, known to be equal to {/",,/ 2 ,..., f JA ). Once the jap- is filled in, the 
5 resulting signal can, of course, be infinitely extended as may be required for various 

numerical applications. 

The pseudo-signal is only used to extend the data periodically retaining 
essentially the same frequency distributions. The utility of the present periodic extension 
algorithm is that it provides an artificial boundary condition for the signal without 

1 0 significant aliasing. The resulting signal can be used with any filter requiring information 

concerning the future and past behavior of the signal. In this paper, we also employ a 
Hermite DAF to filter out the higher frequency noise of the periodic, extended noisy 
signal. For doing this, we assume that the true signal is bandwidth limited and that the 
noise is mostly concentrated in the high frequency region. 

15 As shown in the test example extrapolation in section III, there are infinitely 

many ways to smoothly connect two isolated signals using DAFs with different choices 
of the parameters. We require a procedure to determine the optimum DAF parameters in 
a "blind" manner. Fourier analysis is one way to proceed, but due to the structure of the 
Hermite DAF, we prefer to optimize the parameters while working in physical space 

20 rather than Fourier space. To accomplish this, we introduce a generalized signature for 

both noisy data extension and filtering, which we define to be 



£n (//MF(Sn) ~ /dAf) 

N 



(16) 



25 where, M and aare Hermite DAF parameters, and /daf is the arithmetic average of the 

fDAF&n) (toe dL-oo off DAF (x n )). The signature essentially measures the smoothness of 
the DAF filtered result. 

A typical plot of SJ^dA) is shown in Figure 4(b). We first note that it is 
monotonically decreasing. This is to be expected since increasing dL results in a 
30 smoother, more highly averaged signal. The second major feature of interest is the 

occurrence of a broad plateau. In this region most of the noise has been removed from 
the DAF approximation; however, the dominant portion of the true signal is still 
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concentrated under the DAF frequency window. As a consequence the DAF 
approximation to the function is very stable in this region. As of A increases beyond the 
plateau the width of the DAF window in frequency no longer captures the. tnie signal and 
as a result, the true DAF signal begins to be severely averaged. In the extrenie, only the 



zero frequency remains and f DAF (x n )= /daf and hence SJ^dA) - 0. As we discuss 

below, one can usefully correlate the transition behavior with the best 
DAF-approximation. The first extremely rapid decrease is due to the fact the DAF is 
interpolating and not well tempered. It is the region beyond the initial rapid decrease that 
is important (i.e. y o/Azl .5). To understand the behavior in this region, wejAjite S^ofA) 
in the form 



E„((A/(x n ))W + 2(A/(X„) W(/jjjUsn) ~ W) + (/ffUsn) ~ fPAf) 2 (17) 



(P) 



N 



where f P MF (x») is the DAF approximation using a o/A in the middle of the plateau 

(xn) . The cross term averages to zero because the DAF 

approximation is interpolating and hence the (Af[x,)) DAF fluctuate rapidly reflecting the 
presence of noise. Thus 



S M {a/A) = 



E n (A/(x n ))^ F + (/jfl F (x n ) - W)2 



N 



(18) 



which decreases rapidly since E n (AJ{x n )f DAF is positive and rapidly decreasing as the 
high frequency noise is eliminated from the signal. The transition into the plateau reflects 
a change from an interpolative to a well tempered behavior. Although the algorithm 
presented in this section only refers to periodic extensions, we stress mat this is only one 
possibility out of many. 
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NUMERICAL EXAMPLES 

Two numerical examples are presented in this section to show the usefulness of 
our algorithm. 

Case I 
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The first one is the extrapolation of/[x)=sin(5 zx/128), to which noise has been 
added. The Hermite DAF parameters are M=6 for padding/extension and M=l for 
smoothing in our numerical examples. The weight W p , was taken as discussed above. The 
values at 220 evenly spaced grid points are input over the range [0,219], with 50% 
5 random noise added (f=f * [1+ random(-0.5,0.5)]. The continuation of the solid curve 

from points x 220 to x 2S6 shows the function without noise. We shall predict the remaining 
36 points (excluding jc 256 because the function there must equal the function at x 0 ) by the 
periodic extension algorithm presented in this paper. Because the original continuous 
function without noise is truly periodic, with period 256, this extension corresponds to 

10 filling the gap using noisy input data. The L„ error and the signature^ for periodic 

extension are plotted with respect to dA in Figures 4(a) and 4(b) respectively. From 
Figure 4(b), we see that at dA ~ 10.5, the transition from the plateau to smoothing of the 
true signal occurs. As is evident from Figure 4(a), the minimum extension error also 
occurs at $\sigmaAbigtriangleup$ around 10.5. In Figure 3, we see the comparison of the 

1 5 true function (solid line)from the 220th to the 256th grid point, along with the periodic 

extension result ("+'" symbols). It is clearly seen that they agree very well. 

We next use the padded, extended signal for low pass filtering. We plot in Figures 
5(a) and 5(b) the L„ error and the signature of the filtered result for one complete period 
as a function of dA using M-\2 rather than Af=6 in the DAF. This is done for 

20 convenience for reasons not germane to the subject. The result is that the of A range for 

which the DAF is well-tempered changes and the transition from denoising to signal 
modifying smoothing occurs at dA - 17 (Figure 5(b)). However, we see- from Figure 
5(a) that the L m minimum error also occurs at about the same of A, showing the robustness 
of the approach with respect to the choice of DAF parameters. In Figure 5(c), we show 

25 the resulting smoothed, denoised sine function compared to the original true signal. 

These results illustrate the use of the DAF procedure in accurately extracting a 
band-limited function using noisy input data. Because of the relatively broad nature of 
the L m error near the minimum, one does not need a highly precise value of dA. 
Case II 

30 We now consider a more challenging situation. It often happens in experiments 

that the boundary condition of the experimental signal is not periodic, and is unknown, 
in general. However, the signal is approximately band-limited (i.e., in the DAF class).- 
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To test the algorithm for this case, we use the function given in Eq. (15) as an 
example. Figure 6(a) shows the function, with 20\% random noise in the range (-7,10) 
(solid line). These noisy values are assumed known at only 220 grid points in this range. 
Also plotted in Figure 6(a) are the true values of the function (dashed' line) on the 36 
5 points to be predicted. In our calculations, these are treated, of course, as unknown and 

are shown here only for reference. It is clearly seen that the original function is not 
periodic on the range of 256 grid points. We force the noisy function to be periodic by 
padding the values of the function on these last 36 points, using only the known, noisy 
220 values to periodically surround the gap. 

10 As mentioned in section IV, for nonperiodic signals, the periodic extension is 

simply a scheme to provide an artificial boundary condition in a way. that- does not 
significantly corrupt the frequency distribution of the underlying true signal in the 
sampled region. The periodic padding signature is shown in Figure 7. Its behavior is 
similar to that of the truly periodic signal shown in Figure 4(b). The second rapid 

1 5 decrease begins at about ^A=5.2 and the periodic padding result for this DAF parameter 

is plotted in Figure 6(b) along with the original noisy signal. Compared with the original 
noisy signal, it is seen that the signal in the extended domain is now smoothed. In order 
to see explicitly the periodic property of the extended signal and the degree to which 
aliasing is avoided, we filter the noise out of the first 220 points using an appropriate 

20 Hermite DAF. The L„ error and the signature of the DAF-smoothed results are plotted 

in Figures 8(a) and 8(b) respectively. Again they correlate with each other very well. 
Both the minimum error and the starting point of the second rapid decrease occur at about 
c^A=9.0, which further confirms our analysis of the behavior of the signature. In Figure 
9, we present the smoothed signal (solid line) along with true signal (dashed line), 

25 without any noise added. It is seen that in general, they agree with each other very well 

in the original input signal domain. 

Application of the pseudo-signal in the extended domain clearly effectively 
avoids the troublesome aliasing problem. The minor errors observed occur in part 
because of the fact that the random noise contains not only high frequency 'components 

30 but also some lower frequency components. However, the Hermite DAF is used only as 

a low pass filter here. Therefore, any low frequency noise components are left untouched 
in the resulting filtered signal. Another factor which may affect the accuracy of filtering- 
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is that we only use Af=12. According to previous analysis of the DAF theory, the higher 
the M value, the greater the accuracy [12], but at the expense of increasing the DAF 
bandwidth (dA increases). However, as M is increased, combined with the appropriate 
dA, the D AF-window is better able to simulate an ideal band-pass filter (while still being 
infinitely smooth and with exponential decay in physical and Fourier space.) Here we 
have chosen to employ M=12 because our purpose is simply to illustrate the use of our 
algorithm, and an extreme accuracy algorithm incorporating this principles is directly 
achievable. 

CONCLUSIONS AND DISCUSSIONS 

This paper presents a DAF-padding procedure for periodically extending a 
discrete segment of a signal (which is nonperiodic).The resulting periodic signal can be 
used in many other numerical applications which require periodic boundary conditions 
and/or a given number of signal samples in one period. The power of the present 
algorithm is that it essentially avoids the introduction of aliasing into the true signal. It 
is the well-tempered property of the DAFs that makes them robust computational tools 
for such applications. Application of an appropriate well-tempered DAF to the 
periodically extended signal shows that they are also excellent low pass filters. Two 
examples are presented to demonstrate the use of our algorithm. The first one is a 
truncated noisy periodic function. In this case, the extension is equivalent to an 
extrapolation. 

Our second example shows how one can perform a periodic extension of a 
nonperiodic, noisy finite-length signal. Both examples demonstrate that the algorithm 
works very well under the assumption that the true signal is continuous and smooth. In 
order to determine the best DAF parameters, we introduce a quantity called the signature. 
It works very well both for extensions and low pass filtering. By examining the behavior 
of the signature with respect to dA, we can determine the overall frequency distribution 
of the original noisy signal working solely in physical space, rather than having to 
transform to Fourier space. 
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GENERALIZED SYMMETRIC INTERPOLATING WAVELETS 
INTRODUCTION 

The theory of interpolating wavelets based on a subdivision scheme'has attracted 
muchattentionrecently[l,9, 12, 13, 17,22, 27, 29,40,42, 45,46,47,4$, 49, 54, 55,56, 
5 65 and 66] . Because the digital sampling space is exactly homomorphic to the multi scale 

spaces generated by interpolating wavelets, the wavelet coefficients can be obtained from 
linear combinations of discrete samples rather than from traditional inner product 
integrals. This parallel computational scheme significantly decreases the computational 
complexity and leads to an accurate wavelet decomposition, without any pre-conditioning 

10 or post-conditioning processes. Mathematically, various interpolating wavelets can be 

formulated in a biorthogonal setting. \ - 

Following Donoho's interpolating wavelet theory [12], Harten has described a 
kind of piecewise biorthogonal wavelet construction method [17]. Swelden 
independently develops this method as the well-known "lifting scheme" [56], which can 

15 be regarded as a special case of the Neville filters considered in [27]. The lifting scheme 

enables one to construct custom-designed biorthogonal wavelet transforms by just 
assuming a single low-pass filter (a smooth operation) without iterations. Theoretically, 
the interpolating wavelet theory is closely related to the finite element technique in the 
numerical solution of partial differential equations, the subdivision scheme for 

20 interpolation and approximation, multi-grid generation and surface fitting techniques. 

A new class of generalized symmetric interpolating wavelets (GSIW) are 
described, which are generated from a generalized, window-modulated -interpolating 
shell. Taking advantage of various interpolating shells, such as Lagrange polynomials 
and the Sine function, etc., bell-shaped, smooth window modulation leads to wavelets 

25 with arbitrary smoothness in both time and frequency. Our method leads to a powerful 

and easily implemented series of interpolating wavelet. Generally, this novel designing 
technique can be extended to generate other non-interpolating multiresolution analyses 
as well (such as the Hermite shell). Unlike the biorthogonal solution discussed in [6], we 
do not attempt to solve a system of algebraic equations e)cplicitly. We first choose an 

30 updating filter, and then solve the approximation problem, which is a rth-order-accurate 

reconstruction of the discretization. Typically, the approximating functional is a 
piecewise polynomial. If we use the same reconstruction technique at all the points and- 
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at all levels of the dyadic sequence of uniform grids, the prediction will have a Toplitz- 
like structure. 

These ideas are closely related to the distributed approximating functional 
(DAFs) used successfully in computational chemistry and physics [20, 21, 22, 65, 66, 
67], for obtaining accurate, smooth analytical fits of potential-energy surfaces in both 
quantum and classical dynamics calculations, as well as for the calculation of the state-to- 
state reaction probabilities for three-dimension (3-D) reactions. DAFs provide a 
numerical method for representing functions known only on a discrete grid of points. The 
underlying function or signal (image, communication, system, or human response to 
some probe, etc.) can be a digital time sequence (i.e., finite in length and 1 dimensional), 
a time and spatially varying digital sequence (including 2-D images that can -vary with 
time, 3-D digital signals resulting from seismic measurements), etc. The general structure 
of the DAF representation of the function,^^*A where x can be a vector (z.e., not just 
a single variable), is 

/daf(x> t^Iffa'Xj)\a/A)fix J .tJ 
where <K*-x y )|a/A) is the "discrete DAF",/[:c y ,t p ) is the digital value of the "signal" at 
time t p9 and M and of A will be specified in more detail below. They are adjustable DAF 
parameters, and for non-interpolative DAF, they enable one to vary the behavior of the 
above equation all the way from an interpolation limit, where 

(i.e., the DAF simply reproduces the input data on the grid to as high accuraey as desired) 
to the well-tempered limit, where 

for fwctionf(x,t p )eL 2 (R). Thus the well-tempered DAF does not exactly reproduce the 
input data. This price is paid so that instead, a well-tempered DAF approximation makes 
the same order error off the grid as it does on the grid (i.e., there are no special points). 
We have recently shown that DAFs (both interpolating and non-interpolating) can be 
regarded as a set of scaling functional that can used to generate extremely robust 
wavelets and their associated biorthogonal complements, leading to a full multiresolution 
analysis [22, 46, 47, 48, 49, 54, 55, 66, 67]. DAF-wavelets can therefore serve as an 
alternative basis for improved performance in signal and image processing. 

The DAF wavelet approach can be applied directly to treat bounded domains. As 
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shown below, the wavelet transform is adaptively adjusted around the boundaries of 
finite-length signals by conveniently shifting the modulated window. Thus the 
biorthogonal wavelets in the interval are obtained by using a one-sided stencil near the 
boundaries. Lagrange interpolation polynomials and band-limited Sine fiinctionals in 
Paley- Wiener space are two commonly used interpolating shells for signal approximation 
and smoothing, etc. Because of their importance in numerical analysis, we use these two 
kinds of interpolating shells to introduce our discussion. Other modulated windows, such 
as the square, triangle, B-spline and Gaussian are under study with regard to the time- 
frequency characteristics of generalized interpolating wavelets. By carefully designing 
the interpolating Lagrange and Sine fiinctionals, we can obtain smootlrinterpolating 
scaling functions with an arbitrary order of regularity. 

INTERPOLATING WAVELETS 

The basic characteristics of interpolating wavelets of order D discussed in 
reference [12] require that, the primary scaling function, fa satisfies the following 
conditions. 

(1) Interpolation: 

where Z denotes the set of all integers. 

(2) Self-Induced Two-Scale Relation: fa can be represented _as a linear 
combination of dilates and translates of itself, with a weigTit given by the 
value of^at kll. 

*U) = 2>(*/2M2x-4) (2) 
k 

This relation is only approximately satisfied for some interpolating wavelets 
discussed in the later sections. However, the approximation can be made arbitrarily 
accurate. 

(3) Polynomial Span: For an integer D>0, the collection 6f formal sums 
symbol ^C^x-fc) contains all polynomials of degree D. 

(4) Regularity: For a real V>0 9 ^is Holder continuous of order V. 

(5) Localization: (p and all its derivatives through order \V\ decay rapidly. 
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|^WI^ f (l+Wr . *€R, s>0, Os«|FJ (3) 
where [V] represents the maximum integer that does not exceed V. 

In contrast to most commonly used wavelet transforms, the interpolating wavelet 
transform possesses the following characteristics: 

1. The wavelet transform coefficients are generated by the linear combination of 
signal samplings, 

S jr V a AVk% W jik =2 jn \f{T^ (4) 
instead of the convolution of the commonly used discrete wavelet transform, such as 

W m Hr WdxW' S k)dx (5) 
where the scaling function, 0 Jk (x)=2/ n 0(2!x-k), and wavelet function, ^ Jk (x)-2/ n y{2/x-k) y 
Pf as the interpolant 2*'^2*k)$/?c). 

2. A parallel-computing mode can be easily implemented. The calculation and 
compression of coefficients does not depend on the results of other coefficients. For the 
halfband filter with length^, the calculation of each of the wavelet coefficients, W jk7 does 
not exceed N+2 multiply/adds. 

3. For a D-th order differentiable function, the wavelet coefficients decay rapidly. 

4. In a mini-max sense, threshold masking and quantization are nearly optimal 
approximations for a wide variety of regularity algorithms. 

Theoretically, interpolating wavelets are closely related to the following 
functions: 



Band-limited Shannon wavelets 

The ;r band-limited function, $x)=sin( /dc)/(tu:) eC~ in Paley-Wienet space, 
generates the interpolating functions. Every band-limited function feL\R) can be 
reconstructed using the equation 

n(x-k) (6) 
where the related wavelet function — Sinclet is defined as (see Figure!) . 

r / r } , sin g(2x - 1) - sin n{x - If 2) 

(7) 
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Interpolating fundamental splines 

The fundamental polynomial spline of degree D 9 rf{x\ where D is an odd integer, 
has been shown by Schoenberg (1972), to be an interpolating wavelet (see Figure2). It 
is smooth with order R=D-l 9 and its derivatives through order DA decay exponentially 
[59]. Thus, 

Ak)/?(x-k) ( 8 ) 
where /P(x) is the B-spline of order D defined as 

(9) 

Here U is the step function 



0 
0 

(10) 



and {aP (&)} is a sequence that satisfies the infinite summation condition 

LW(»-*) = i(n) (11) 

Deslauriers-Dubuc functional 

Let D be an odd integer, DX). There exist functions F D such that \fF D has already 
been defined at all binary rationals with denominator 2 / , it can be extended by polynomial 
interpolation, to all binary rationals with denominator 2 /+1 , i.e. all points halfcvay between 
previously defined points [9, 13]. Specifically, to define the function at (hh\/2)/2/ when 
it is already defined at all {*2 V }, fit a polynomial it jk to the data (k'/2f 9 F D (A72 7 ) for 
k'e {V [k-{D-\)l2} 9 .. „2ifrf(ZHl)/2]} . This polynomial is unique 



F D ( j ) B *j.k ( j ) 



(12) 



This subdivision scheme defines a function which is uniformly continuous at the 
rationals and has a unique continuous extension; F D is a compactly supportedinterval 
polynomial and is regular; It is the auto-correlation function of the Daubechies wavelet 
of order £M-1 . It is at least as smooth as the corresponding Daubechies wavelets (roughly- 
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twice as smooth). 

Auto-correlation shell of orthonormal wavelets 

If $ is an orthonormal scaling function, its auto-correlation .^=^*^ (-*)isan 
interpolating wavelet (Figure3) [40]. Its smoothness, localization and the two-scale 
relation are inherited from $ . The auto-correlations of Haar, Lamarie-Battle, Meyer, and 

Daubechies wavelets lead to, respectively, the interpolating Schauder, interpolating 
spline, C~ interpolating, and Deslauriers-Dubuc wavelets. 

Lagrange half-band filters 

Ansari, Guillemot, and Kaiser [1] used Lagrange symmetric halfband FIR filters 
to design the orthonormal wavelets that express the relation between the Lagrange 
interpolators and Daubechies wavelets [7]. Their filter corresponds to the Deslauriers- 
Dubuc wavelet of order D=l (2M-1), M=4. The transfer function of the halfband 
symmetric filter h is given by 

H{z) = Vz + zT\z 2 ) (13) 
where T is the trigonometric polynomial. Except for /i(0)=l/2, at every even integer 
lattice point A(2n)=0, n*0, neZ. The transfer function of the symmetric FIR filter 
h{riy=h(-n) y has the form 

H(z) = 1 / 2 + £ h(2n - l)(z ] - 2n + z 2 - 1 ) (14) 

The concept of an interpolating wavelet decomposition is similar to "algorithm 
a trous", the connection having been found by Shensa [42], The self-induced scaling and 
interpolation conditions are the most important characteristics of interpolating wavelets. 
From the following equation 

M=I*An)Ax-n) (15) 
and eq.( 1 ), the approximation to the signal is exact on the discrete sampling points, which 
does not hold in general for commonly used non-interpolating wavelets. 



GENERALIZED INTERPOLATING WAVELETS 
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Interpolating wavelets with either a Lagrange polynomial shell or Sine functional 
shell are discussed in detail. We call these kinds of window modulated wavelets 
generalized interpolating wavelets, because they are more convenient io construct, 
processing and extend to higher dimensional spaces. 
5 Generalized Lagrange Wavelets 

Three kinds of interpolating Lagrange wavelets, Halfband Lagrange wavelets, B- 
spline Lagrange wavelets and Gaussian-Lagrange DAF wavelets, are studied here as 
examples of the generalized interpolating wavelets. 

Halfband Lagrange wavelets can be regarded as extensions of the Dubuc 

10 interpolating functional [9, 13], the auto-correlation shell wavelet analysis [40], and 

halfband filters [1]. B-spline Lagrange Wavelets are generated by a B-splirie-windowed 
Lagrange functional which increases the smoothness and localization properties of the 
simple Lagrange scaling function and its related wavelets. Lagrange Distributed 
Approximating Functionals (LDAF)-Gaussian modulated Lagrange polynomials, have 

15 been successfully applied for numerically solving various linear and nonlinear partial 

differential equations. Typical examples include DAF-simulations of 3-dimensional 
reactive quantum scattering and the solution of a 2-dimensional Navier-Stokes equation 
with non-periodic boundary conditions. In terms of a wavelet analysis, DAFs can be 
regarded as particular scaling functions (wavelet-DAFs) and the associated DAF- 

20 wavelets can be generated in a number of ways [20, 21, 22, 65, 66, 67]. 

Halfband Lagrange Wavelets 
A special case of halfband filters can be obtained by choosing the filter 
coefficients according to the Lagrange interpolation formula. The filter coefficients 
25 are given by 

(-D"* W - |2 ff(* +i/2-m) 

(A/-„)!(A/+n-l)!(2n-1) ( 16 ) 

These filters have the property of maximal flatness in Fourier space, possessing a 
30 balance between the degree of flatness at zero frequency and the flatness at the 

Nyquist frequency (half sampling). 

These half-band filters can be utilized to generate the interpolating wavelet 
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decomposition, which can be regarded as a class of the auto-correlated shell of 
orthogonal wavelets, such as the Daubechies wavelets [7]. The interpolating wavelet 
transform can also be extended to higher order cases using different Lagrange 
polynomials, as [40] 



The predictive interpolation can be expressed as 

rSj(i) = £ P^ x (0)[S, (i + 2n - 1) + Sj(i - 2#t + 1)] > 



(17) 



(18) 



where T is a projection and Sj is the yth layer low-pass coefficients. This projection 
relation is equivalent to the subband filter response of 

h{2n-iy*P M (0) (19) 
The above-mentioned interpolating wavelets can be regarded as the extension of the 
fundamental Deslauriers-Dubuc interactive sub-division scheme, which results when 
M=2. The order of the Lagrange polynomial is D=2M-1=3 (Figure 6(a)). 

It is easy to show that an increase of the Lagrange polynomial order D y will 
introduce higher regularity for the interpolating functionals (Figure 7(a)). When Z)-H-<» 
the interpolating functional tends to a band-limited Sine function and its domain of 
definition is on the real line. The subband filters generated by Lagrange interpolating 
functionals satisfy the properties: 

(1) Interpolation: h(a>)+h(&*-ji)=l 

(2) Symmetry: h(<v)=h(-0) 

(3) Vanishing Moments: ^^^(a:)cLc=^ 

Donoho outlines a basic subband extension to obtain a perfect reconstruction. He 
defines the wavelet function as 

Vix)=^2xA) (20) 
The biorthogonal subband filters can be expressed as 
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h g{a>)=t { *, S (co)=€ { "h(a) + n) (21) 

However, the Donoho interpolating wavelets have some drawbacks. Because the low- 
pass coefficients are generated by a sampling operation only, as the decomposition layer 
increases, the correlation between low-pass coefficients become weaker. The 
interpolating (prediction) error (high-pass coefficients) strongly increases, which is 
deleterious to the efficient representation of the signal. Further, it can not be used to 
generate a Riesz basis for L 2 (R) space. 

Swelden has provided an efficient and robust scheme [56] for constructing 
biorthogonal wavelet filters. His approach can be utilized to generate high-order 
interpolating Lagrange wavelets with higher regularity. As Figure 4 shows, P Q is the 
interpolating prediction process, and the P x filter is called the updating filter, used to 
smooth the down-sampling low-pass coefficients. If we choose P 0 to be the same as P u 
then the new interpolating subband filters can be depicted as 

A, (<*>) = I + g(G>)P(2w) . £22) 

g l (a>) = e- i "-h(a>)P(2c>) 

The newly developed filters h l9 g u h , and S also generate the biorthogonal dual pair 

for a perfect reconstruction. Examples of biorthogonal lifting wavelets with regularity 
D=3 are shown in Figure 5. Figure 6 gives the corresponding Fourier responses of the 
equivalent subband decomposition filters. 

B-Spline Lagrange Wavelets 

Lagrange polynomials are natural interpolating expressions for functional 
approximations. Utilizing a different expression for the Lagrange polynomials, we can 
construct other forms of useful interpolating wavelets as follows. We define a class of 
symmetric Lagrange interpolating functional shells as 

ft 

<■-»■'*<> 1 (23) 
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It is easy to verify that this Lagrange shell also satisfies the interpolating condition on 
discrete, integer points, 



'" W io, otherwise < 24 ) 



However, simply defining the filter response as 

h(k)=P(k/2)/2, k~M, M (25) 
leads to non-stable interpolating wavelets, as shown in Figure 7. 

Including a smooth window, which vanishes at the zeros of the Lagrange 
polynomial, will lead to more regular interpolating wavelets and equivalent subband 
filters (as shown in Figure 7 and 8). We select a well-defined B-spline function as the 
weight window. Then the scaling function (mother wavelet) can be defined as an 
interpolating B-Spline Lagrange functional (BSLF) 



P»{xlli) fr X-i_ 

£"(0) Jib* 



where N is the B-spline order and, tj is the scaling factor to control the window width. 
To ensure coincidence of the zeroes of the B-spline and the Lagrange polynomial, we set 

2M=7x(AR-l) (27) 
To ensure the interpolation condition, the B-spline envelope degree M must be odd 
number. It is easy to show that if B-spline order is W=4A+1, tj can be any odcf integer 
(2fc+l); if Nis an even integer, then ^can only be 2. When N~4k-l 9 we can not construct 
an interpolating shell using the definition above. From the interpolation and self-induced 
scaling properties of the interpolating wavelets, it is easy to verify that 

A(*)=AX*/2)/2, *=-2Af+l, 2M-1 (28) 

Gaussian-Lagrange DAF Wavelets 

We can also select a class of distributed approximation functional-^Gaussian- 
Lagrange DAFs (GLDAF) as our basic scaling function to construct interpolating 
wavelets as: 

-^W ft (29) 
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where WJ^x) is a window function. It is chosen to be a Gaussian, 

*W = e" , ' 2 ' > (30) 
ecause it satisfies the minimum frame bound condition in quantum physics. Here <xis a 
window width parameter, and Pjjx) is the Lagrange interpolation kernel. The DAF 
5 scaling function has been successfully introduced as an efficient and powerful grid 

method for quantum dynamical propagations [40]. Using Swelden's lifting scheme [32], 
a wavelet basis is generated. The Gaussian window in our DAF-wavelets efficiently 
smoothes out the Gibbs oscillations, which plague most conventional wavelet bases. The 
following equation shows the connection between the B-spline window function and the 
10 Gaussian window [34]: 



30 



(31) 

for large N. As in Figure 12, if we choose the window width 



15 a = 7jj(N+ 1)/ 12 (32) 

the Gaussian-Lagrange wavelets generated by the lifting scheme will be similar to the li- 
sp line Lagrange wavelets. Usually, the Gaussian-Lagrange DAF displays a slightly better 
smoothness and more rapid decay than the B-spline Lagrange wavelets. If we select more 
sophisticated window shapes, such as those popular in engineering (Bartlett, Hanning, 
20 Hamming, Blackman, Chebychev, and Bessel windows), the Lagrange wavelets can be 

generalized further. We shall call these extensions Bell-windowed Lagrange wavelets. 

Generalized Sine Wavelets 

As we have mentioned above, the ^band-limited Sine function, 
25 <fo)=s\n{m)l(m)r (33) 

in Paley-Wienex space, constructs an interpolating function. Every n band-limited 
function feL 2 (R) can be reconstructed by the equation 

/(*> = !/(*) n[x . k) (34) 



where the related wavelet function — Sinclet is defined as (see Figurel) 
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, . sin;r(2x-!)-sin/r(x-!/2) (35) 

w(x) = - 

/r(x-l/2) 

The scaling Sine function is the well-known ideal low-pass filter, which possesses the 
ideal square filter response as 

5 

fl, \o>\S>nl2 

H ™% W2^ M (36) 

Its impulse response can be generated as 

Kk) =h*n.«i) J k *da>ri7r=sin(M)l(xk) (37) 
1 0 The so-called half-band filter possess a non-zero impulse only at the odd iirteger sampler, 

h(2k+\\ while at even integers, A[2£]=0 unless a /f=0. 

However, this ideal low-pass filter is never used in application. Since the digital 
filter is an IIR (infinite impulse response) solution, its use as a digital cutoff FIR (finite 
impulse response) will produce Gibbs phenomenon (overshot effect) in Fourier space, 
1 5 which degrades the frequency resolution (Figurel 1). The resulting compactly supported 

Sine scaling and wavelet functions, as well as their biorthogonal dual scaling and wavelet 
functions, are shown in Figurel2. We see that the regularity of the cutoff Sine is 
obviously degraded with a fractal-like shape, which leads to poor time localization. 
B-Spline Sine Wavelets 

20 Because the ideal low-pass Sine wavelet can not be implemented "ideally" by FIR 

(finite impulse response) filters, to eliminate the cutoff singularity, a windowed 
weighting technique is employed to adjust the time-frequency localization of the Sine 
wavelet analysis. To begin, we define a symmetric Sine interpolating functional shell as 

, x sin(«r/2) 

25 (38) 



Utilizing a smooth window, which vanishes gradually at the exact zeros of the Sine 
functional, will lead to more regular interpolating wavelets and equivalent subband filters 
(as shown in Figure 1 3 and 14). For example, we illustrate using a well-defined B-spline 
30 function as the weight window. Then the scaling function (mother wavelet^ can be 

defined as an interpolating B-spline Sine functional (BSF) 
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w'ffi"* (39) 

p N {xln) sin(«/2) 

where N is the B-spline order and, tj is the scaling factor to control the window width. 
To ensure the coincidence of the zeroes of the B-spline and the Sine shell, we set 

2M+l=7x(Ar+l)/2 (40) 
To maintain the interpolation condition, /i(2£)=0, teO, it is easy to show that when the 
B-spline order W=4&+1, tj may be any odd integer (2£+l). If TV is an even integer, then 
rj can only be 2. When N=4k-\, we can not construct interpolating shell using the 
definition above. The admissibility condition can be expressed as 

fa = 2, N = 2i 

l7=2* + l,tf = 4/+I (41) 



From the interpolation relation 
15 1(U*() (42) 



and the self-induced two-scale relation 

+t*) = Il*l*/lW2x--k) (43) 

it is easy to show that 

20 h(k)=0 M (k/2)/2, *=-2A/+l , 2M-\ (44) 

Gaussian-Sine DAF Wavelets 

We can also select a class of distributed approximation functional, i.e., the 
Gaussian-Sine DAF (GSDAF) as our basic scaling function to construct interpolating 
25 scalings, 

= ^ (x) sin^2) (45) 



where WJ^x) is a window function which is selected as a Gaussian, 

(46) 



WO 99/46731 



PCT/US99/05426 



39 



Because it satisfies the minimum frame bound condition in quantum physics, it 
significantly improves the time-frequency resolution of the Windowed-Sine wavelet. 
Here ois a window width parameter, and P(x) is the Sine interpolation kernel. This D AF 
scaling function has been successfully used in an efficient and powsrful.grid method for 
quantum dynamical propagations [40]. Moreover, the Hermite DAF is known to be 
extremely accurate for solving the 2-D harmonic oscillator, for calculating the 
eigenfunctions and eigenvalues of the Schrodinger equation. The Gaussian window in 
our DAF-wavelets efficiently smoothes out the Gibbs oscillations, which plague most 
conventional wavelet bases. The following equation shows the connection between the 
B-spline and the Gaussian windows [34]: 



the Gaussian Sine wavelets generated by the lifting scheme will be similar to the B-spline 
Sine wavelets. Usually, the Gaussian Sine DAF displays a slightly better smoothness and 
rapid decay than the B-spline Lagrange wavelets. If we select more sophisticated window 
shapes, the Sine wavelets can be generalized further. We call these extensions Bell- 
windowed Sine wavelets. The available choices can be any of the popularly used DFT 
(discrete Fourier transform) windows, such as Bartlett, Hanning, Hamming, Blackman, 
Chebychev, and Besel windows. 

ADAPTIVE BOUNDARY ADJUSTMENT 

The above mentioned generalized inteipolating wavelet is defined on the domain 
C(i?). Many engineering applications involve finite length signals, such as image and 
isolated speech segments. In general, we can define these signals on C[0,1]. One could 
set the signal equal to zero outside [0,1], but this introduces an artificial "jump" 
discontinuity at the boundaries, which is reflected in the wavelet coefficients. It will 
degrade the signal filtering and compression in multi scale.space. Developing wavelets 
adapted to "life on an interval" is useful. Periodization and symmetric periodization are 
two commonly used methods to reduce the effect of edges. However, unless the finite 
length signal has a large flat area around the boundaries, these two methods cannot 




(47) 



for large N. As in Figure 6, if we choose the window width 




(48) 
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remove the discontinuous effects completely [4,6,1 1]. 

Dubuc utilized an iterative interpolating function, F D on the finite interval to 
generate an interpolation on the set of dyadic rationals D y The inteipolation in the 
neighborhood of the boundaries is treated using a boundary-adjusted functional, which 
5 leads to regularity of the same order as in the interval. This avoids the discontinuity that 

results from periodization or extending by zero. It is well known that this results in 
weaker edge effects, and that no extra wavelet coefficients (to deal with the boundary) 
have to be introduced, provided the filters used are symmetric. 

We let Kj represent the number of coefficients at resolution layer j, where Kj=Qf. 
10 Let 2 / >2D+2, define the non-interacting decomposition. If we let y^.hoid the non- 

interaction case 2 /0 >2CH-2, then there exist functions A Imeryai ^ Wwa/ . " suc ij ^ at f or 



15 



20 



ali/eC[0,l], 



/Interval Interval 

The (p ^ , iff , are called the interval interpolating scalings and wavelets, 
which satisfy the interpolation conditions 

\y,<;;r' ( 2-'- 1 * ) - '^<W. ■ () * « < (50) 
The interval scaling is defined as 

25 where 0 Jtk \ [QA] is called the "inner-scaling" which is identical to the fundamental 

/Right J Left 

interpolating function, and 0 and (p are the "left-boundary" and the "right- 

boundary" scalings, respectively. Both are as smooth as ^J [0i ij. Interval wavelets are 
defined as 
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^.0£t<Lf>/2j 

"y.*|[o.i).Lo/iJ**<2 y -L£>/2j ^ 52 ^ 
'yT- J> -Lo/2js*<i^ 



Uft Right 

^ ( J [0f ij is the inner-wavelet, and Iff Jk and $^ >Jk are the left and. right-boundary 

wavelets, respectively, which are of the same order regularity as the inner-wavelet [13]. 
5 The corresponding factors for the Deslauriers-Dubuc interpolating wavelets are 

M=2, and the order of the Lagrange polynomial is D=2M- 1=3. The interpolating wavelet 
transform can be extended to high order cases by two kinds of Lagrange polynomials, 
where the inner-polynomials are defined as [14] 

p. (xi= rr *-(2m-i) 

10 *- ,v ^;L.(2n-lM2m-l) (53) 



This kind of polynomial introduces the interpolation in the intervals according to 

(54) 



PjS(i) = I />2»-l <0)|?y<'* + 2n - 1) + Sj{i - In + 1)]/ = 2* + 1 
n = 1 

and the boundary polynomials are 



mc o,m**/ </ - m (55) 



20 which introduce the adjusted interpolation on the two boundaries of the intervals. That 

is, 

D 

P;S{i) = £L4(it2)Sjli + 2rf - I),/ » 2* + 1.0 S * * [/> - «J'2 

25 The left boundary extrapolation outside the intervals is defined as 

P y S(-l) = £ irf(-l/2)Sy(2d),/= 2* + 1,0 5 * S |jD - I J/2 



JoO 



(57) 



30 



and the right boundary extrapolation is similar to the above. The boundary_adjusted 
interpolating scaling is shown in Figure 16. 

Although Dubuc shows the interpolation is almost twice differentiable, there 
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still is a discontinuity in the derivative. In this paper, a DAF-wavelet based boundary 
adjusted algorithm is introduced. This technique can produce an arbitrary smooth 
derivative approximation, because of the infinitely differentiable character of the 
Gaussian envelope. The boundary-adjusted scaling fimctionals are generated as 
5 conveniently as possible just by window shifting and satisfy the following equation. 

0 m (x)=W(x-2m)P(x), /k=-L(M-1)/2J, L(M-l)/2j (58) 
where $Jx) represents different boundary scalings, W{x) is the generalized window 
function and P(x) is the symmetric interpolating shell. When m>0, left boundary 
fimctionals are generated; when m<0, we obtain right boundary fimctionals. The case 

10 m=0 represents the inner scalings mentioned above. 

One example for a Sinc-DAF wavelet is shown in Figure 17. We choose the 
compactly-supported length of the scaling function to be the same as the halfband 
lagrange wavelet. It is easy to show that our newly developed boundary scaling is 
smoother than the commonly used Dubuc boundary interpolating functional. Thus it will 

15 generate a more stable boundary adjusted representation for finite-length wavelet 

transforms, as well as a better derivative approximation around the boundaries. Figure 1 8 
is the boundary filter response comparison between the halfband Lagrange wavelet and 
our DAF wavelet. It is easy to establish that our boundary response decreases the 
overshoot of the low-pass band filter, and so is more stable for boundary frequency 

20 analysis and approximation. 

APPLICATIONS OF GSIWs 

Eigenvalue Solution of 2D Quantum Harmonic Oscillator 
As discussed in [22], a standard eigenvalue problem of the Schrodinger 
25 equation is that of the 2D harmonic oscillator, 

2 2 "] 

■ £ &r + *2 (JC|2 + x 2 > r* ( *i - x 2 > = £ *°* <*» - x 2 ) (5 9) 

Here 0 k and £*are the Ath eigenfunction and eigenvalue respectively. The eigenvalues 
30 are given exactly by 

£ M 2 = l + + *2 > 0^<oo, 0^1*2 * (60) 
with a degeneracy (k d =k+ 1 ) in each energy level E k = 1 +k. The 2D version of the wavelet" 
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DAF representation of the Hamiltonian operator was constructed and the first 21 
eigenvalues and eigenfiinctions obtained by subsequent numerical diagonalization of the 
discrete Sinc-DAF Hamiltonian. As shown in Tab.l, all results are accurate to at least 10 
significant figures for the first 16 eigenstates. It is evident that DAF-wavelets are 
powerful for solving eigenvalue problems. 

Tablel 



Eigenvalues of the 2D harmonic oscillator 



k=k r +k y , 


K 


Exact Solution 


Sinc-DAF Calculation 


0 


1 


1 


0.99999999999835 


1 


2 


2 


1.99999999999952 
1:99999999999965 


2 


3 


3 


2.99999999999896 
2.99999999999838 
2.99999999999997 


3 


4 


4 


3.99999999999943 
3.99999999999947 
3.99999999999986 
3.99999999999994 


4 


5 


5 


4.99999999999907 
4.99999999999953 
4.99999999999989 
5.00000000000674 
5.00000000000813 


5 


6 


6 


5.99999999999982 
6.00000000000018 
6.00000000000752- 
6.00000000000801 . 
6.0000000001 1972_ 
6.00000000012005 • 



Target Extraction 

Military target extraction, including such applications as high-resolution radar 
(aerial photograph) imaging, radar echo, and remote sense detection, is a difficult 
subject. National Defense Agencies (e.g. the Navy, Army, and Air Force) have great 
interest in technical advances for reconnaissance, earlier warning and target 
recognition. Some of our DAF-v/avelet schemes represent a significant breakthrough 
for these very difficult tasks. Compared with other methods, our algorithm possesses a 
very high resolution for target localization and high efficiency for clutter/noise 
suppression, as well as computational efficiency. 
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Detecting a military target in a low luminance environment is challenging 
work for image processing. To improve the target discrimination, the visibility of 
differences between a pair of images is important for modern image restoration and 
enhancement. We construct a method for detectability using a multiple channel 
enhancement technique. The images were captured on a color monitor at a viewing 
distance giving 95 pixels per degree of visual angle and an image size of 5.33*5.05 
deg. The mean luminance of the images was about 10cd/m 2 [39]. Using our newly 
developed visual enhancement techniques, visual targets can be extracted very 
accurately in a low-luminance environment for detection and warning. The technique 
combines the response of human vision system (HVS) with multiresolutibn 
enhancement and restoration methods. The simulation of tank-target detection in a 
low-luminance environment is shown in Figure 19. 

Image Filtering 

Image de-noising is a difficult problem for signal processing. Due to the 
complicated structure of image and background noise, an optimal filtering technique 
does not currently exist. Generally, the possible noise sources include photoelectric 
exchange, photo spots, image communication error, etc. Such noise causes the visual 
perception to generate speckles, blips, ripples, bumps, ringing and aliasing. The noise 
distortion not only affects the visual quality of images, but also degrades the 
efficiency of data compression and coding. De-noising and smoothing are extremely 
important for image processing. 

We use a DAF-wavelet to generate a more efficient, human-vision-system- 
based image processing technique, which processes the advantages of 1} long range 
de-correlation for convenience of compression and filtering; 2) high perceptual 
sensitivity and robustness; 3) filtering that takes account of the human visual 
response. It therefore can enhance the most important visual information, such as 
edges, while suppressing the large scale of flat regions and background; 4) it can be 
carried out with real-time processing. 

Biorthogonal interpolating wavelets and corresponding filters are constructed 
based on Gauss-Lagrange distributed approximating functional (DAFs). Themtility 
of these DAF wavelets and filters is tested for digital image de-noising in combination 
with a novel blind restoration technique. This takes account of the response of human - 
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vision system so as to remove the perceptual redundancy and obtain better visual 
performance in image processing. The test results for a color photo are shown in 
Figure20. It is evident that our Color Visual Group Normalization technique yields 
excellent contrast and edge-preservation and provides a natural color result for the 
restored image [48]. 

Imaging Enhancement 

Mammograms are complex in appearance and signs of early disease are often 
small and/or subtle. Digital mammogram image enhancement is particularly important 
for solving storage and logistics problems, and for the possible development of an 
automated-detection expert system. The DAF-wavelet based mammogram - 
enhancement is implemented in the following manner. First we generate a perceptual 
lossless quantization matrix Q Jm to adjust the original transform coefficients C f m (k). 
This treatment provides a simple human-vision-based threshold technique for the 
restoration of the most important perceptual information in an image. For grayscale 
image contrast stretching, we appropriately normalize the decomposition coefficients 
according to the length scale, Z, of the display device [16] so that they fall in the 
interval of [0,1] of the device frame 

^q m w=^ m q m w/L. (6i) 

We then use a nonlinear mapping to obtain the desired contrast stretching 

NQ.m =r Jt JCjJNC lm ) ' (62) 

where the constant y jm and function X Jm are appropriately chosen so that the desired 
portion of the grayscale gradient is stretched or compressed. 

To test our new approach, low-contrast and low quality breast mammogram 
images are employed. A typical low quality front-view image is shown in Figure 
21(a). The original image is coded at 512x512 pixel size with 2 bytes/pixel and 12 
bits of gray scale. We have applied our edge enhancement normalization and device- 
adapted visual group normalization. As shown in Figure 2.1(b), and Figure21(c), there 
is a significant improvement in both the edge representation and image contrast In 
particular, the domain and internal structure of high-density cancer tissues are more 
clearly displayed. Figure 22(a) is an original 1024x1024 side-view breast image 
which has been digitized to a 200 micron pixel edge with 8 bits of gray scale. The 
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enhanced image result is shown in Figure 22(b). In this case we again obtain a 
significant improvement in image quality as described herein. 

CONCLUSION 

In summary, a new class of wavelets — generalized symmetric interpolating 
wavelets were described, which are generated by a window modulated interpolating 
shell. Due to the absence of a complicated factorization process, this kind of 
interpolating wavelet is easily implemented and possesses very good characteristics in 
both time (space) and spectral domains. The stable boundary adjustment can be 
generated by a window shifting operation only. It overcomes the overshoot of the 
boundary response introduced by other boundary processing, such as Dubuc Lagrange 
wavelet and Daubechies boundary filters. Many successful applications of DAF- 
wavelets have been reported to illustrate its practicality and its mathematical behavior. 
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DUAL WINDOW SELECTIVE AVERAGING FILTER 
INTRODUCTION 

Local averaging is one of the simplest filtering techniques. It preserves the 
mean gray level while suppressing the variability in flat regions. However, simple 
linear local averaging is undesirable for image smoothing because it is incapable of 
preserving image edges. Linear local averaging filters are essentially low pass filters, 
which tend to blur the edges and fine structures in the original image. In order to 
preserve the edges while achieving some degree of smoothing, it is desirable to 
employ nonlinear filters. A number of nonlinear selective averaging methods [1-8] 
have been devised for this purpose. The basic idea for these methods is to select only 
a portion of the gray level values in the local window to use in a (weighted) average. 
In references [3-5], an alpha-trimmed mean filter (a-TMF), which uses a "median 
basket" to select a predetermined number of pixels above and below the median pixel 
to the sorted pixels of the moving window, was proposed. 

The values in the basket are averaged to give the a-TMF filtering output. An 
asymmetric way to select the averaging pixels whose values are close to that of the 
median pixel was presented in references [4-6] and was named the modified trimmed 
mean filter (MTMF) in reference[4]. Recently, we developed a generalization of the 
a-TMF, which we denoted as GTMF [8]. 

It employs the same way to select the pixels from the window for inclusion in 
the median basket. However, the selected pixels and the center pixel in the window 
are weighted and averaged to give the filtering output. It has been shown-{8] that the 
GTMF performs better than other well known filters for the removal of either 
impulse noise or additive noise. 

A new nonlinear filtering technique is disclosed, called the "dual window 
selective averaging filter" (DWSAF), to remove additive noise (e.g., Gaussian noise). 
Assuming implicitly that the ideal image is piecewise flat, two normal concentric 
moving windows and a pixel container are employed to determine the values to be 
used in replacing the gray level value of the center pixel 7;. Three steps are employed 
in this filtering algorithm. First, the GTMF is implemented within the smaller. window 
W s to give an intermediate output G c at the center pixel of the window. Second, only 
the gray level values close to G e are selected from the larger window W L and put to the- 
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container C. Third, if the number of pixels in C equals zero, the final DWSAF output 
D c is just G c9 otherwise, the gray level values in C are averaged to give the final 
DWSAF output D e . 

In contrast to the a-TMF, the GTMF also includes the center pixel I c in the 
averaging operation and its weight is usually larger than those of other pixels in the 
median basket, which is important for the removal of additive noise. 

A threshold T based on the GTMF output G c is used to select the pixels for 
the container. A pixel is selected for inclusion in the container if its value is in the 
range of [G e -T, G c +7]. When 7M), the DWSAF is equivalent to the GTMF within the 
smaller window, and when T= 255 (for an 8bpp gray scale image), it becomes a 
simple moving average within the larger window. 

GENERALIZED ALPHA-TRIMMED MEAN FILTER 

The implementation of the generalized trimmed mean filter (GTMF) has been 
described in detail in [17]. The pixels { 7 2 , . . / m .„ 7 m , 7 m+ „ . . . I n } in the local 
window associated with a pixel 7 C , have been arranged in an ascending (or descending) 
order, with I m being the median pixel. The key generalization to median filtering 
introduced in the alpha-trimmed mean filter (a-TMF) [18] is to design a median 
basket to combine a group of pixels whose gray level values are close to the median 
value of the window. 

An averaging operation is then utilized to generate an adjusted replacement A c 

for I c . For example, a 3-entry median basket a-TMF is implemented accenting to 

A c = ^ m ~ l ^ m 1 (1) 
3 

It is evident that a single-entry median basket a-TMF is equivalent to the 
median filter and a /z-entry median basket a-TMF is equivalent to the simple moving 
average filter. 

In general, the a-TMF outperforms the median filter. However, it is still not 
optimal when filtering either the additive noise corrupted images or highly impulse 
noise corrupted images. For example, when removing additive noise, the a-TMF 
does not take the I c as a special pixel. As is well known, for an image corrupted by 
additive noise, I c has the largest probability of being the Closest to the true value 
among all the pixels in the window. Neglecting the influence of the center pixel is a 
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mistake if one desires to filter additive noise. Another difficulty for the a-TMF, 
when it is used to remove impulse noise from highly corrupted images, is that the 
pixels selected for the median basket may also be corrupted by impulse noise. It is 
therefore unreasonable for all the pixels in the basket to have the same weight. 

The GTMF uses a median basket to collect a group of pixels from the sorted 
pixel values of the window associated with 7 C , in the same way as the a-TMF. 

The values of the selected pixels and I c are then weighted and averaged to give 
the GTMF output. 

For a 3-entry median basket, the output of the GTMF is given by 

Q - Mlim-l + w 2 I m + w 2 I m ^ +W C I C 



+w 2 + w 3 + w c ' (2) 
where G c is the GTMF output, w„ w 2 , *v 3 , and w c are the weights. It is interesting to 
see that when w c =0 and w,=^v 2 =W3= *0, the GTMF reduces to the a-TMF. When 
w{=w 2 =w y = =0, it becomes the standard median filter. Since the GTMF takes the 
15 center pixel I c into account, we expect that it should also work better for removing 

additive noise. 

DUAL WINDOW SELECTIVE AVERAGING FILTER 

In this section, we will discuss how to implement our dual window selective 

20 averaging filter (DWSAF), based on the GTMF. The standard selective averaging 

algorithm [2] is not optimal because it only computes the average (or weighted- 
average) of the gray level values close to that of the center pixel in a window. 
However, sometimes it is unreasonable to choose the gray level value of the center 
pixel as a criteria because the pixel may be highly corrupted. Using the standard 

25 selective averaging algorithm to treat such a pixel will result in it remaining highly 

corrupted. 

The key idea of the DWSAF is to find an alternative criteria for the standard 
selective averaging algorithm to use. In reference [4], a modified trimmed mean 
filter (MTMF) was proposed to average only those pixels whose gray level values fall 
30 within the range [Af f -q, M c +q], where M c is the median value (M=I m ) and q is a 

preselected threshold. 

Here, we propose a new filtering algorithm which employs two different sizes . 
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of concentric moving windows W s and W u and a pixel container C. Three steps are 
involved in the filtering technique. First, the GTMF is employed in the smaller 
window W s according to Eq. (2) to give an intermediate output G cJ which -is used as 
the criteria for the standard selective averaging algorithm. 

Second, from the larger window W u only the pixels with gray level values 
close to G c are selected and placed in a container C. In doing so, a switching threshold 
T is employed to determine if a pixel in W L is close enough to G c . We determine if a 
pixel I k in W L belongs to C as follows: 



/fc€C, if \I k -G c \<T, (3) 

where Tis the threshold. 

Third, if the number of pixels //in the container C is greater than zero, the 
gray level values of the pixels in the container are averaged with the same weight for 
all pixels to give the final output of the DWSAF. If AM), the final output equals G c .. 
We summarize this as 

ff e , if N = 0 ' (4) 



where D c is the output of the DWSAF to replace I c and A c is the average of the gray 
level values in the container. The reason for employing two windows is as follows. As 
in the case of median filter, the implementation of the GTMF using a larger window 
blurs the image more than using a smaller window. However, the output of the 
GTMF in a smaller window can be used as a criteria to improve the filtering result by 
use of the standard selective averaging algorithm in the larger window. 

This reduces the image blurring because we only average those gray level 
values that are close to the GTMF output. For different choices of the threshold T 9 the 
DWSAF can be made to be either a GTMF with respect to the smaller window W s (T 
= 0) or a simple moving average within the larger window W L (r=255 for an 8bpp 
gray-scale image). 

The DWSAF algorithm can be employed iteratively, which generally 
improves its performance. If the weight w c of the center pixel is too high, the output 
values at some pixels may remain close to their original input values after a number of 
iterations, and thus some isolated impulse-like noise may be present. Numerical 
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experimentation shows that changing the threshold 7* to zero after a number of 
iterations greatly alleviates this problem. 

NUMERICAL EXPERIMENTS 

Two numerical examples are presented here for testing our filtering algorithm. 
The first one is a standard one-dimensional blocked signal, which is shown in Figure 
1(a). We degrade it with Gaussian noise of mean square error MSE=1. 00 and mean 
average error MAE=0.80 (see Figure 2(b)). The sizes of the two moving windows we 
used are 7 and 19 for W s and W L respectively. The weights for the 3-entry basket 
which we used for the GTMF are w,=w 3 =0, w 2 =3 and w c =4. The switching threshold 
is chosen to be 7M.5. To obtain improved filtering results, the DWSAF algorithm is 
implemented recursively. 

In Figure 1(c), we show the filtered signal. It is seen from Figure 1(c) that the 
filtered result is in excellent agreement with the original noise-free signal. The MSE 
and MAE of our filtered result are 6.19E-2 and 6.49E-3 respectively! The DWSAF is 
extremely efficient and robust for removing additive noise from blocked signals. 

To further confirm the usefulness of the algorithm, we next consider a 
two-dimensional benchmark 8 bit, 512 \times 512, "Lena" image, which has been 
degraded with two different amounts of Gaussian noise. Figure 2(a) shows the 
corrupted Lena image with peak signal-to-noise ratio PSNR=22.17 dB and Figure 
3(a) shows the corrupted image with PSNR=18.82 dB. Since the original noise-free 
image is not exactly piecewise flat, it is not desirable to produce a filtering 
performance as efficient as in the first example. However, we expect that the 
DWSAF will yield improved results. 

The sizes of the two windows for both corrupted images are 3 for W s and 5 for 
W v Three-entry median basket is used for both corrupted images, and the weights are 
w,:w 2 :w 3 :w e =l:l:l:10 for the image shown in Figure 2(a) and w,:w 2 :w 3 :w e =l:l:l:50 
for the image shown in Figure 3(a). The initial switching threshold T is 50 for both 
corrupted images, and is changed to 0 after the first iteration for the lower noise 
image and after the second iteration for the higher noise image. Figures 2(b) and 3(b) 
show the filtered images produced by our numerical algorithm. Comparing them with 
the degraded images, it is clear that the algorithm can effectively remove noise and 
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significantly preserve edges simultaneously. For a quantitative evaluation of the 
performance of the algorithm, the PSNR, mean square error (MSE) and mean 
absolute error (MAE) comparison for different filtering algorithms are listed in 
TABLE I. 

TABLE I. 

Comparative Filtering Results for Lena Image 
Corrupted with Different Amount of Gaussian Noise 



10 



15 





Noisy Image I 


Noisy Image II 


Algorithm* 


PSNR 


MSE 


MAE 


PSNR 


MSE 


MAE 


No Denoising 


22.17 dB 


394.37 


15.97 


18.82 dB 


853.07 


23.71 


Median (3 x 3) 


29.38 dB 


75.09 


6.26 


27.60 dB 


112.97 


7.72 


Median (5 x 5) 


28.60 dB 


89.66 


6.53 


27.39 dB 


118.41 


7.53 


a-TMF (3 x 3) 


29.84 dB 


67.46 


6.07 


28.13 dB 


100.10 


7.04 


MTMF (3 x 3) 


29.91 dB 


66.46 


5.79 


28.23 dB 


97.74 


7.39 


New Approach 


30.69 dB 


55.50 


5.34 


28.96 dB 


82.69 


6.45 



20 



*A11 results are implemented recursively for optimal PSNR performance. 
The threshold used for the MTMF algorithm is optimized to q=65 for 
image I and q=100 for image II. 



25 



The PSNR is increased about 8.52 dB for the lower noise image krid 1Q.14 dB 
for the higher noise image, both of which are better than the best median,_a-TMF and 
MTMF filtering results. 



30 



CONCLUSIONS 

In this paper, a new filtering algorithm for removing additive noise from a 
corrupted image is presented. In comparison to standard filtering schemes, the 
DWS AF algorithm employs two concentric moving windows to determine the gray 
level value used to replace that of the center pixel. Employing the larger window 
does not lead to significant blurring of the image because of the effect of the smaller 
window. The GTMF is used in the smaller window for obtaining the criterion for 
standard selective averaging in the larger window. Numerical examples show that the 
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proposed filtering algorithm is both efficient and robust for recovering blocked 
signals contaminated with additive noise. 
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ARBITRARY DIMENSION EXTENSION OF SYMMETRIC DAF 
INTRODUCTION 

The distributed approximating functional (DAF) approach to the time 
evolution of wave packets is a powerful, new method aimed at taking advantage of the 
5 local nature of the potential in the coordinate representation for ordinary chemical 

collisions, and the localized nature of the kinetic energy in this same representation. 
These features of the Hamiltonian lead to a highly band-pass expression for the 
propagator, specific to a restricted class of wave packets (DAFs). With control of the 
parameters which determine it, the DAF class can be made to include the wave 

10 packets of interest in any particular time-dependent quantum problem. Because the 

propagator is banded, its application to the propagation on the grids (1) scales like the 
number of grid points, N, in any dimension (which is the ultimate for any grid 
methods; the scaling constant depends on the band width); (2) requires reduces 
communication time when implemented on massively parallel computer; and (3) 

1 5 minimizes the storage requirement for the propagator. 

In general, DAF can be regarded as a radial interpolating basis, as well as the 
scaling function. The corresponding DAF wavelets can be implemented using 
generalized DAFs. As a kind of radial basis functional (RBF), DAF neural networks 
are constructed and applied in signal processing. 

20 We presented the derivation of the DAF s independent of the choice of 

coordinates and show how the Hermite functions becomes generalized for arbitrary 
numbers of degrees of freedom and choice fo orthogonal coordinates. We.discuss the 
properties of the generalized "DAF polymonals" associate with different coordinate 
and numbers of degrees of freedom, and how angular momentum conservation leads 

25 to a radial propagator for spherical waves. Symmetric quincunx-DAF Dirichlet 

classes are carefully designed for hyper spherical surface representation. 

DAF THEORY 

The theory of fundamental distributed approximating functional is based on 
30 the Hermite function representation of the ID ^function which is 

(1). 
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The function H ln is the Hermite polynomial of even order, 2n. The quantity a 
is the width of the Gaussian window of the Hermite polynomial. The qualitative 
behavior of one particular Hermite DAF is shown in Figurel. The Hermite 
polynomial H is generated by the usual recursion 



The predominant advantage of the Hermite polynomial approximation is its 
high-order derivative preservation (which leads to a smooth approximation). This 
paper expands and tests the approach of Madan and Milne (1994) for pricing 
contingent claims as elements of a separable Hilbert space. We specialize the-Hilbert 
space basis to the family of Hermite polynomials and use the model to price options 
on Eurodollar futures. Restrictions on the prices of Hermite polynomial risk for 
contingent claims with different times to maturity are derived. These restrictions are 
rejected by our empirical tests of a four-parameter model. The unrestricted results 
indicate skewness and excess kurtosis in the implied risk-neutral density. These 
characteristics of the density are also mirrored in the statistical density estimated from 
a time series on LIBOR. The out-of-sample performance of the four-parameter model 
is consistently better than that of a two-parameter version of the model. 

The Hermite polynomial are given in terms of their generator by 



where V x is the x component of the gradient operator. This equation provides two 
equivalent and useful ways in which the above function can be expressed as 



X n = 0 
tf„(x) = <2x, n = l 

2x// lM (x)-2(«-l)// w . 2 (x), n>\ 



(2) 



//„(*)exp(-x 2 ) = (-D" ^rexp(-x 2 ) = (-1)" V, exp(-x 2 ) 



(3) 




exp[-(x-x , ) 2 /2cr 2 ] 



(4) 



or 



S(x-x'\a) = 7 feY i if)" U^ 2 r V? exp[-(x - x' ) 



2 /2<r 2 ] 



(5)- 
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10 



15 



20 



25 



where in the second equation V]" ^bs been replaced by • The ^ 

equivalent because the derivative acts on an even function of (x-x'). 

The simple practical implementation is truncating the sum at some* maximum 
value M/2 to obtain 



as indicated by the notation, the distributed approximating functional 8 M (x|cr) depend 
upon the M and a, nut it does not depend upon the coordinate system per se. Here it is 
easy to show that 



for any fixed M. The availability of two independent parameters, either of which can 
be used to generate the identity kernel or Dirac delta function, can be viewed as the 
source of robustness of the DAFs as computational tools. See e.g., D. K. Hoffman, D. 
J. Kouri, "Distributed approximating functional theory for an arbitrary number of 
particles in a coordinate system-independent formalism," J. Physical Chemistry, 
Vol.97, No. 19, pp.4984-4988, 1993. A simple expression of Hermite DAF is as 



ARBITRARY DAF EXTENSION IN MULTI-DIMENSION SPACE 
Tensor-product extension 

Different selections of interpolating shells result in different DAFs. 
Theoretically, this kind of functional can be regarded as the smoothing operator or the 
scaling function in wavelet theory. It can be used to generate the corresponding 
wavelets (differential functional) for signal analysis. The discrete wavelet transform 
is implemented using filterbanks. Hoffman and Kouri gave a tensor-product extension 
of DAF in multi-dimensional Hilbert space. It can be explained in detail as following. 
Consider a Dirichlet system which can be represented in a general configuration space 
by a vector R with an arbitrary number AT of Cartesian components, jc„ jc 2 , . . x N . By 




(6) 
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"Cartesian" we mean that the "tensor" volume element for the space is given by 

dX = fldx. 

if J (9) 

Thus, an arbitrary function, ¥(X), in Hibert space can be represented by 

TO=J %X-X*) ViXW ( 1 0) 

where fiX-X 7 ) is the ^function in ^/-dimension space which can be written in the 
form 



8{X-X') = f[ S { Xj - x/) 

(ID 



Using the tensor product to substitute the ID scalar variable, we can obtain the 
expression in N-dimensional Hilbert space as 

5{X -X>\a) = (<Tj2^r»f j W H2ct 2 Y^ expK* -X') 1 /2cr 2 ] 

~« (12) 



where the expression 



and the multi-variable 



20= Z 2 -SO (is) 

" =0 «,-0/i^0 ,, A , v ' 



m (14) 



In multi-dimensional Hilbert space 



" = I>y (15) 



The gradient operation is 
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where V] is the N-dimensional Laplacian operator. 

c*-jy -£(,,.,,,. (17) 

5 The advantage of this expression is that it is independent of the particular 

choice of coordinate system and we can use whatever coordinate are convenient for a 
particular problem (e.g. Cartesian, cylindrical polar, spherical polar, hyperspherical, 
elliptical, etc., as the case might be). A 2D Tensor based Hermite DAF is shown in 
Figure2. The generalized interpolating matrix is hyper-square and can be separated by 
10 different directional vector. 



Tensor-product extension 

DAF define a class of symmetric interpolants. Besides the tensor-product 
extension, it could be with more interesting shapes. To preserve the symmetry 
15 property, any DAF or DAF-like inerpolaitng functional is expressed as 

8(x)=S(|x|) (18) 
Therefore the fundamental variable for DAF processing is |x|. The extended 
norm-preserved quincunx solution is implemented by the following substitution in N- 
D space 



20 



\X\=(Z\xj\ P ) Uq 

(19) 



As p=q, the N-D space multi-variable becomes to the l p norm 
25 \\X\\ p =(±\ x j\ P ?' P (20) 

When p equals to Yi, the contour of the 2D quincunx DAF is a pseudo-diamond shape. 
The practical truncated shape is represented by the following formula: 

30 (21) 

(22)' 
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"<x,jO = e~ (x+ * M2 ^ (23) 

(24) 



w<x,jj;) = e-^> ;r ^ 2 ) 



5 CONCLUSION 

A new algorithm for impulse noise removal was presented. A group of 
significant values in the neighboring window of one pixel are bundled and weighted 
to obtain a modified luminance estimation (MLE). A threshold selective-pass 
technique is employed to determine whether any given pixel should be replaced by its 
10 MLE. Iterative processing improves the performance of our algorithm for -highly 

corrupted images. Numerical experiments show that our technique is extremely 
robust and efficient to implement, and leads to significant improvement over other 
well-known methods. 



15 
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LAGRANGE WAVELETS FOR SIGNAL PROCESSING 
INTRODUCTION 

The theory of interpolating wavelets based on a subdivision scheme has 
attracted much attention recently. It possesses the attractive characteristic that the 
wavelet coefficients are obtained from linear combinations of discrete samples rather 
than from traditional inner product integrals. Mathematically, various interpolating 
wavelets can be formulated in a biorthogonal setting. Harten has described a kind of 
piecewise biorthogonal wavelet construction method [12]. Swelden independently has 
developed essentially this method into the well known "lifting scheme" theory [32], 
which can be regarded as a special case of the Neville filters [19]. Unlike the previous 
method for constructing biorthogonal wavelets, which relies on explicit solution of 
coupled algebraic equations [10], the lifting scheme enables one to construct a 
custom-designed biorthogonal wavelet transforms assuming only a single low-pass 
filter without iterations. 

Generally speaking, the lifting-interpolating wavelet theory is closely related 
to: the finite element technique for the numerical solution of partial differential 
equations, the subdivision scheme for interpolation and approximation, multi-grid 
generation and surface fitting techniques. The most attractive feature of the approach 
is that discrete samplings are made which all identical to wavelet multiresolution 
analysis. Without any of the pre-conditioning or post-conditioning processes required 
for accurate wavelet analysis, the interpolating wavelet coefficients can be 
implemented using a parallel computational scheme. 

Lagrange interpolation polynomials are commonly used for signal 
approximation and smoothing, etc. By carefully designing the interpolating Lagrange 
functionals, one can obtain smooth interpolating scaling functions with arbitrary order 
of regularity. In this paper, we will present three different kinds of biorthogonal 
interpolating Lagrange wavelets (Halfband Lagrange wavelets, B-spline Lagrange 
wavelets and Gaussian-Lagrange DAF wavelets) as specific examples of generalized 
interpolating Lagrange wavelets. 

Halfband Lagrange wavelets can be regarded as an extension of Dubuc 
interpolating functionals [8, 1 1], auto-correlation shell wavelet analysis [26], and 
halfband filters [1], B-spline Lagrange Wavelets are generated by a B-spline- 
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windowed Lagrange functional which increases the smoothness and localization 
properties of the simple Lagrange scaling function and related wavelets. 

Lagrange Distributed Approximating Functionals (LD AF)-Gaussian 
modulated Lagrange polynomials have been successfully applied for numerically 
solving various linear and nonlinear partial differential equations [40]. Typical 
examples include DAF-simulations of 3-dimensional reactive quantum scattering and 
2-dimensional Navier-Stokes fluid flow with non-periodic boundary conditions. In 
terms of wavelet analysis, DAFs can be regarded as particular scaling functions 
(wavelet-DAFs); the associated DAF-wavelets can be generated in a number of ways 
[41]. Both DAFs and DAF-wavelets are smooth and decay rapidly in both the time 
and frequency representations. One objective of the present work is to extend the 
DAF approach to signal and image processing by constructing new biorthogonal 
DAF-wavelets and their associated DAF-filters using the lifting scheme [32]. 

As an example application of Lagrange wavelets, we consider image 
processing, de-noising and restoration. This application requires dealing with huge 
data sets, complicated space-frequency distributions and complex perceptual 
dependent characteristics. De-noising and restoration play an important role in image 
processing. Noise distortion not only affects the visual quality of images, but also 
degrades the efficiency of data compression and coding. 

To explicit the time-frequency characteristics of wavelets, an earlier group 
normalization (GN) technique [28, 29] has been utilized to re-scale the magnitudes of 
various subband filters and obtain normalized equivalent decomposition filters 
(EDFs). The group normalization process corrects the drawback that the magnitudes 
of the transform coefficients do not correctly reflect the true strength of the various 
signal components. 

Moreover, in order to achieve the best noise-removing efficiency, the human 
vision response is best accounted for by a perceptual normalization (PN) based on the 
response property of the Human Vision System (HVS). The concept of visual loss-less 
quantization, introduced by Watson [39], is utilized to construct the visualloss-less 
matrix, which modifies the magnitudes of the normalized wavelet coefficients. 

Perceptual signal processing has the potential of overcoming the limits of the 
traditional Shannon Rate-distortion (R-D) theory for perception-dependent 
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information, such as images and acoustic signals. Previously, Ramchandran, Vetterli, 
Xiong, Herley, Asai, and Orchard have utilized a rate-distortion compromise for 
image compression [14, 23, 24, and 42]. Our recently derived VisuaKjroup 
Normalization (VGN) technique [31] can likely be used with rate-distortibn 
5 compromise to generate a so-called Visual Rate-Distortion (VR-D) theory to further 

improve image processing. 

Softer Logic Masking (SLM) is an adjusted de-noising technique [29], 
designed to improve the filtering performance of Donoho's Soft Threshold (ST) 
method [9], The SLM technique efficiently preserves important information, 
10 particularly at an edge transition, in a manner particularly suited to human visual 

perception. 



INTERPOLATING WAVELETS 

The basic characteristics of interpolating wavelets of order D discussed in 
15 reference [10] require that the primary scaling function, $ satisfies the following 

conditions. 

(1) Interpolation: 

20 



(1) 



(2) Self-induced Two-Scale Relation: ^can be represented as a linear 
combination of dilates and translates of itself, while the weight is the value- of <p at a 
subdivision integer of order 2. 

*U) = Z^(*/2)«*(2x-A) 

25 k (2) 



This is only approximately satisfied for some of the interpolating wavelets discussed 
in the later sections; however, the approximation can be made arbitrarily accurate. 

(3) Polynomial Span: For an integer D*0, the collection of formal sums, 
30 symbolized by £C*$x-A:), contains all polynomials of degree D. 

(4) Regularity: For real V>0, <p is Holder continuous of order V. 

(5) Localization: ^and all its derivatives through order [V] decay rapidly. 
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I^WI^^O+lxir.xe^^O^riKj, (3) (3) 

where [V] represents the maximum integer which does not exceed V. 

Interpolating wavelets are particularly efficient for signal representation since 
5 their multiresolution analysis can be simply realized by discrete sampling. This makes 

it easy to generate a subband decomposition of the signal without requiring tedious 
iterations. Moreover, adaptive boundary treatments and non-uniform samplings can be 
easily implemented using interpolating methods. 

Compared with commonly used wavelet transforms, the interpolating wavelet 
10 transform possesses the following characteristics: 

1. The wavelet transform coefficients are generated by linear combination of 
signal samplings, instead of the commonly used convolution of wavelet transform, 
such as 

WjAxW)dx (4) 

15 where y/ jX {x)=l! n ■trf?x-k). 

2. A parallel-computing algorithm can be easily constructed. The calculation and 
compression of coefficients are not coupled. For the halfband filter with length N 9 the 
calculation of the wavelet coefficients, W j k , does not exceed W+2 multiply/adds for 
each. 

20 3. For a Z)-th order differentiate function, the wavelet coefficients decay rapidly. 

4. In a mini-max sense, threshold masking and quantization are nearly optimal 
for a wide variety of regularization algorithms. 

Theoretically, interpolating wavelets are closely related to the following 
wavelet types: 
25 Band-limit Shannon wavelets 

The ^-band-limited function, (x)=sin(jc)/(x)C in Paley-Wienet space, constructs 
interpolating functions. Every band-limited function JL 2 (R) can be reconstructed by 
the equation 

/t.i-s/u rr'*-*' 

30 ' * (l - t) (5) 



where the related wavelet function (the Sinclet) is defined as (see Figurel) 
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, sin n{2x - 1) - sin x(x - 1 / 2) (g) 

^ X) = l^UT) 

Interpolating fundamental spline 

The fundamental polynomial spline of degree £>, rf{x), where D is an odd 
integer, has been shown by Schoenberg (1972) to be an interpolating wavelet (see 
Figure2). It is smooth with order R~D- 1, and its derivatives through order DA decay 
exponentially [34]. Thus 

r 1 D (x)^a D (k)fi D (x-k)^ 

(7) 

where /P(x) is the B-spline of order D defined as 
Here U is the step function 

... . f(U<() 

and {aP(k)} is the sequence that satisfies the infinite summation condition 

y a D {k)p 0 (n-k) = S(n) 

> (10) 
Deslauriers-Dubuc functional 

Let D be an odd integer, and Z>X). There exist functions, F D , such that if F D 
has already been defined at all binary rationals with denominator 2 / , it can be extended 
by polynomial interpolation, to all binary rationals with denominator 2T\ i.e. all 
points halfway between previously defined points [8, 1 1]. Specially, to define the 
function at (k+\/2)/2f when it is already defined at all {k2' j } , fit a polynomial 7T jk to 
the data (k'/2/ 9 F D (k'/2/) for£'e{2 v [*-(ZM)/2],.. .,2^(D+l)/2]}. This polynomial is 
unique 



_ , * + l/2 , , /: + l/2 , 



(11)- 
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This subdivision scheme defines a function that is uniformly continuous at the 
rationals and has a unique continuous extension. The function F D is a compactly 
supported interval polynomial and is regular; it is the auto-correlation function of the 
Daubechies wavelet of order £H-1. This function is at least as smooth as the 
5 corresponding Daubechies wavelets. 

Auto-correlation shell of orthonormal wavelets 

If (f> is an orthonormal scaling function, its auto-correlation 

= I HO* Kx-t)dt is m interpolating wavelet (Figure3) [26]. Its smoothness, 

10 localization and two-scale relations derive from ^ . The auto-correlation of Haar, 

Lamarie-Battle, Meyer, and Daubechies wavelets lead, respectively, to the 
interpolating Schauder, interpolating spline, C° interpolating, and Deslauriers-Dubuc 
wavelets. 

Lagrange half-band filters 
15 Ansari, Guillemot, and Kaiser [1] have used Lagrange symmetric halfband 

FIR filters to design the orthonormal wavelets that express the relation between the 
Lagrange interpolators and Daubechies wavelets [6], Their filter corresponds to the 
Deslauriers-Dubuc wavelet of order D=l (2M-1), M=4. The transfer function of the 
halfband symmetric filter h is given by 



20 



//(z) = i+zT(r l ) 

2 (12) 



where Tis a trigonometric polynomial. Except for A(0)=l/2, at every even integer 
lattice /j(2«)=0, n*0, nel. The transfer function of the symmetric FIR filter h(n)=h(- 
25 n), has the form 

H (z) = 1 / 2 + £ h(2n - l)(z , - 2n + z 2 "~ % ) 

(13) 



30 



The concept of an interpolating wavelet decomposition is similar to that of 
"algorithm a trous," the connection having been found by Shensa [27]. The self- 
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induced scaling conditions and interpolation condition are the most important 
characteristics of interpolating wavelets. According to the following equation 

AxyLJ&Wx-n) (14) 
the signal approximation is exact on the discrete sampling points, which does not hold 
5 in general for commonly used non-interpolating wavelets. 

LAGRANGE WAVELETS 

Halfband Lagrange Wavelets 

The halfband filter is defined as whose even samples of the impulse response 
10 are constrained such as A(0)=l/2 and A(2n)=0 for n=±l, ±2,.. . A special case of 

symmetric halfband filters can be obtained by choosing the filter coefficients- 
according to the Lagrange interpolation formula. The filter coefficients are then given 
by 

15 K2n " 0 = (M - n)!(Af '+ n - 1)! <2n - 1) (15) 

These filters have the property of maximal flatness. They possess a balance between 
the degree of flatness at zero frequency and flatness at the Nyquist frequency (half 
sampling). 

20 These half-band filters can be utilized to generate the interpolating wavelet 

decomposition, which can be regarded as a class of auto-correlated shell orthogonal 
wavelets such as the Daubechies wavelets [6]. The interpolating wavelet transform 
can also be generated by different Lagrange polynomials, such as [26] according to 

The predicted interpolation can be expressed as 

rS,(0 = f, Pi.-x (0)[S ; « + In - 1) + Sj (i -2n+l)]> 

(17) 

/=2*+l, 



30 



where T is a projection and the Sj are the jth layer low-pass coefficients. This 
projection relation is equivalent to the subband filter response of 
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/i(2«-l)=/> 2/I . l (0) (18) 
The above-mentioned interpolating wavelets can be regarded as an extension 
of the fundamental Deslauriers-Dubuc interactive sub-division scheme (factorized as 
A/=2, while the order of Lagrange polynomial is D=2M-l=3) (Figure 6(a)). 

It is easy to verify that an increase of the Lagrange polynomial order will 
introduce higher regularity in the interpolating functionals (Figure 7(a)). When 

the interpolating functional becomes the -band-limited Sine function and its 
domain of definition is the real line. The subband filters generated by Lagrange 
interpolating functionals satisfy 

(1) Interpolation: /i(<y)+/j(tfH-;z)=l 

(2) Symmetry: h(co)=h{-a)) 

(3) Vanishing Moments: f R x?$x)foc=5 p 

Donoho outlines a basic subband extension for perfect reconstruction. He 
defines the wavelet function as 

K*)=#2x-1) (19) 
The biorthogonal subband filters can be expressed as 

h(a>) - 1, g{<o) = e' ,ta , g{co)^e- tio h(Q}^n) ( 2 0) 

However, the Donoho interpolating wavelets have some drawbacks, because the low- 
pass coefficients are generated by a sampling operation only, as the decomposition 
layer increases, the correlation between low-pass coefficients becomes weaker. The 
interpolating (prediction) error (high-pass coefficients) strongly increases, which 
destroys the compact representation of the signal. Additionally, it does not lead to a 
Riesz basis for L 2 (R) space. 

Swelden has provided, by far, the most efficient and robust scheme [32] for 
constructing biorthogonal wavelet filters. His approach is to generate high-order 
interpolating Lagrange wavelets with increased regularity. As Figure 4 shows, P 0 is 
the interpolating prediction process, and P x the so-called the updating filter, makes the 
down-sampling low-pass coefficients smoother. If we choose P 0 to be the same -as P ly 
then the new interpolating subband filters can be depicted as 
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The newly developed filters A„g„ /J* , and g also construct the biprthogonal 

dual pair for perfect reconstruction. Examples of generated biorthogonal lifting 
5 wavelets with different regularity are shown in Figure 6 and Figure 7. 

B-Spline Lagrange Wavelets 

Lagrange polynomials are natural interpolating expressions. Utilizing a 
different expression for the Lagrange polynomials, we can construct other types of 
interpolating wavelets. 

10 We define a class of symmetric Lagrange interpolating functional shells as 



n (22) 



It is easy to verify that this Lagrange shell also satisfies the interpolating condition on 
1 5 discrete integer points, since 

fl, k = 0 

PmW "10, otherwise (23) 

However, simply defining the filter response as 
20 /i(*)=P(A/2)/2, k~M, M (24) 

will lead to non-stable interpolating wavelets, as shown in Figure 9. 

Utilizing a smooth window, which vanishes at the zeros of the Lagrange 

polynomial, will lead to more regular interpolating wavelets and equivalent subband 

filters (as in Figures 10 and 1 1). If we select a well-defined B-spline function as the 
25 weight window, then the scaling function (mother wavelet) can be defined as an 

interpolating B-Spline Lagrange functional (BSLF) 

P'jxtTj) fr X-± < 25 > 

/?"(o) t M*o - f 

30 where N is the B-spline order, and 7j is the scaling factor to control the window .width. 

To ensure coincidence of the zeroes of the B-spline and the Lagrange polynomial, we 
set 
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2M=7?x(N+\) (26) 
To preserve the interpolation condition, the B-spline envelope factor M must be odd 
number. It is easy to show that when the B-spline order is N=4k±l 9 rj can1>e any odd 
integer (2A+1). UN is an even integer, then tj can only be 2. When N=4k-l, we cannot 
5 construct an interpolating shell according to the above definition. 

From the interpolation and self-induced scaling of the interpolating wavelets, 
it is easy to establish that 

h(k)=^kI2)I2, A=-2M+1, 2A/-1 (27) 
Gaussian-Lagrange DAF Wavelets 
10 We can also select a distributed approximating functional-Gaussian Lagrange 

DAF (GLDAF) as our basic scaling function to construct interpolating wavelets. 
These are 

<l> M {x)^W 9 (x)P M {x) 

= *W ft (28) 

15 

where WJx) is a window function which is selected to be a Gaussian, 

^W = e- r,/2 "\ (29) 

20 because it satisfies the minimum frame bound condition in quantum physics. Here a 

is a window width parameter, and Pjj(x) is the Lagrange interpolation kernel. The 
DAF scaling function has been successfully introduced as the basis for an efficient 
and powerful grid method for quantum dynamical propagations [40]. Using the lifting 
scheme [32], a wavelet basis is generated. The Gaussian window in our DAF- 

25 wavelets efficiently smooths out the Gibbs oscillations, which plague most 

conventional wavelet bases. The following equation shows the connection between 
the B-spline function and the Gaussian window [34]: 

'■wjii^) < 30 > 

30 

for large N. As in Figure 12, if we choose the window width to be 
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, (31) 

the Gaussian Lagrange wavelets generated by the lifting scheme will be. much like the 
B-spline Lagrange wavelets. Usually, the Gaussian Lagrange DAF based wavelets are 
5 smoother and decay more rapidly than B-spline Lagrange wavelets. If we select more . 

sophisticated window shapes, the Lagrange wavelets can be generalized further. We 
shall call these extensions Bell-windowed Lagrange wavelets. 



VISUAL GROUP NORMALIZATION 

10 It is well known that the mathematical theory of wavelet transforms and 

associated multiresolution analyses has applications in signal processing and. 
engineering problems, where appropriate subband filters are the central entities. The 
goal of wavelet signal filtering is to preserve meaningful signal components, while 
efficiently reducing noise components. To this end, we use known magnitude 

15 normalization techniques [28, 29] for the magnitudes of filter coefficients to develop a 

new perceptual normalization to account for the human vision response. 

From a signal processing point of view, wavelet coefficients can be regarded 
as the results of the signal passing through equivalent decomposition filters (EDF). 
The responses of the EDF LC hm {ai) are the combination of several recurrent subband 

20 filters at different stages. As shown in Figure 6., the EDF amplitudes of different sub- 

blocks are different. Thus the magnitude of the decomposition coefficients in each of 
the sub-blocks will not exactly reproduce the true strength of the signal components. 
Stated differently, various EDFs are incompatible with each other in the wavelet 
transform. To adjust the magnitude of the response in each block, the decomposition 

25 coefficients are re-scaled with respect to a common magnitude standard. Thus EDF 

coefficients, C jm {k), on layer j and block m should be multiplied by a magnitude 
scaling factor, A J m , to obtain an adjusted magnitude representation [28]. This factor 
can be chosen as the reciprocal of the maximum magnitude of the frequency response 
of the equivalent filter on node Q\m) 



30 



^ sup(,LC ,..(.),) Q = [0 > 2 « ] (32) 
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This idea was recently extended to Group Normalization (GN) of wavelet packets for 
signal processing [29]. 

An image can be regarded as the result of a real object processed by a human 
visual system. The latter has essentially many subband filters. The responses of these 
human filters to various frequency distributions are not at all uniform. Therefore, an 
appropriate normalization of the wavelet coefficients is necessary. Actually, the 
human visual system is adaptive and has variable lens and focuses for different visual 
environments. Using a just-noticeable distortion profile, we can efficiently remove the 
visual redundancy from decomposition coefficients and normalize them with respect 
to a standard of perception importance. A practical, simple model for perception 
efficiency has been presented by Watson, et al. [5] for data compression. This model 
is adapted here to construct the "perceptual lossless" response magnitude Y j m for 
normalizing according to the visual response function [39], 



where a defines the minimum threshold, k is a constant, R is the Display Visual 
Resolution (DVR),^ is the spatial frequency, and d m is the directional response 
factor. Together with the magnitude normalized factor A J m , this leads to the perceptual 
lossless quantization matrix 



This treatment provides a simple, human- vision-based threshold technique 
[39] for the restoration of the most important perceptual information in an image. For 
gray-scale image processing, the luminance (magnitude) of the image pixels is the 
principal concern. We refer to the combination of the above mentioned two 
normalizations as the Visual Group Normalization (VGN) of wavelet coefficients. 

MASKING TECHNIQUE 

Masking is essential to many signal-processing algorithms. Appropriate 
masking will result in reduction of noise and undesired components. Certainly, it is 
very easy to set up masking if the spectral distribution of a signal and its noise is 
known. However, in most cases, such prior knowledge is not available. Statistical 




(33) 



Qj.m~2Yj m A Jm 



(34) 
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properties of the signal and its noise are assumed so that the noise is taken to be 
relatively more random than the signal in each subband. Hard logic masking and soft 
logic masking techniques are discussed in the following two subsections. 
Hard Logic Masking 

5 The Visual Group Normalization method provides an efficient approach for 

re-normalizing the wavelet decomposition coefficients so that various subband filters 
have appropriate perceptual impulse responses. However, this algorithm alone does 
not yield the best SNR in real signal processing. Essentially, various noise and/or 
interference components can be embedded in different nodes of the subband 

10 decomposition tree. To achieve SNR-improved reconstruction of the signal and/or 

image, a filtering process is needed to reduce the noise and preserve the main signal 
information. Noise due to random processes has a comparatively wide-band 
distribution over the decomposition tree, whereas mechanical noise can have a 
narrow-band distribution over a few specific subband components. Therefore, time- 

15 varying masking techniques are utilized to reduce noise. We discuss a few useful 

masking methods in the rest of this subsection. 

Single Dead-Zone Threshold Masking 
A single zone threshold masking is the simplest masking method. With a 
given decomposition tree, a constant threshold r is selected for our magnitude 

20 normalized wavelet decomposition coefficients NC j m (k). That is, if the absolute value 

of NC Jm (k) is greater than the threshold r, the original decomposition coefficient will 
be kept; otherwise it will be set to zero. That is 



25 



Cjjk) \NCj. m (k)\>r 

0 K.J*)|<r (35) 



This approach is similar to the Pass-Band Selection technique of Ref. [43], 
used in a FFT framework. However, in the present approach, the decomposition 
coefficients are re-scaled using the visual group normalization. Therefore, even with a 
single zone masking, it is expected that for a given noisy signal the present wavelet 
30 analysis will achieve a better SNR than that of a single-band FFT method. 

Adaptive Node Mean/Variance Threshold Masking 
In practice, we hope the threshold r can be adaptively adjusted to the strength 
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of the noisy environment. Thus the threshold r should be set higher to suppress a 
noisier signal, and in general, r should vary as a function of the statistical properties 
of the wavelet decomposition coefficients, the simplest and most important of which 
are the mean and second variance. These are incorporated in the present work. 

We define the mean and second variance of the magnitude of the normalized 
coefficients on node (j,m) as 



(36) 



and 

^=[^IW^W-^] 2 ] 2, (37) 

where Nj=2' j N and N is the total length of a filter. A masking is then set according to 
the following analysis: 

(1) Introduce a factor |a|<l, aeR. 

(2) Set the corresponding wavelet transform coefficients, C j m (k), to zero, if the 
inequality, \NC Jm (k)\<\rj jm +czcrj tm \ holds. This implies that the magnitude of the 
normalized coefficients NC jm (k) is less than the mean, rj Jm , within the statistical 
deviation of aa- ; m , and hence that NC J m (k) as a noise component. 

(3) Retain the corresponding wavelet transform coefficients, Cj m (k) y for 
reconstruction, satisfying the inequality, \NCj m (k)\>\7fj m +aoj m \. This implies that the 
magnitude ofNC Jm (k) is greater than the mean, 7/ Jm9 within the statistical deviation of 
aa j.m- We consider such a NC Jm (k) as a target component. The rules (2) and (3) are 
summarized as 

c J C /-<*> \ NC J^ k PK^cza^\ 
^ 1 0 |^(*)|<| 7 ^ + acr^| (38) 

Adaptive Whole Tree Threshold Masking 
For certain applications, it is possible that ail NC J m (k) on a particular node 
(j,m) have essentially the same values. The aforementioned adaptive node 
mean/variance threshold masking technique becomes invalid in such a case. We use 
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an adaptive whole-tree threshold masking method for this situation. The basic 
procedure is very similar to Method 2, except that the mean and second variance are 
calculated for the whole tree T y according to 

(3,) 



and 



^ilLlK^-^T Q,m)mT. (40) 



The corresponding reconstruction coefficients are selected by the rules 



Cj, m W \HC JtM (k)\>\r! + ao\ 

0 \NC^<\^aa\ 1 } 



Constant False Alarm Masking 

In certain applications, such as radar signals generated from a given 
environment, it is useful to select an alarm threshold, r, based on the mean value of 
multiple measurements of the background signal. This approach is similar to a 
background-contrasted signal processing in which only the differences of the signals 
optimal tree decomposition coefficients, NC jm (k\ and n c'j*) > the background 

decomposition coefficients of the same tree structure, are used for backgrojond-" 
contrasted signal reconstruction. 
Softer Logic Masking 

The various maskings discussed above can be regarded as hard logic masking, 
which are similar to a bias-estimated dead-zone limiter. Jain [16] has shown that a 
non-linear dead-zone limiter can improve the SNR for weak signal detection 

r,[y) = sgn{y){W-5)l-\zpz\ (42) 



where <5"is a threshold value. Donoho has shown that the J9=\ case of the above m 
expression is a nearly optimal estimator for adaptive NMR data smoothing and de- 
noising [9]. Independently, two of the present authors (Shi and Bao) in a previus work ' 
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[28] have utilized hard logic masking to extract a target from formidable background 
noise efficiently. 

The various threshold cutoffs of multiband expansion coefficients -in hard 
logic masking methods are very similar to the cutoff of a FFT expansion. Thus, Gibbs 

5 oscillations associated with FFTs will also occur in the wavelet transform using a hard 

logic masking. Although hard logic masking methods with appropriate threshold 
values do not seriously change the magnitude of a signal after reconstruction, they can 
cause considerable edge distortions in a signal due to the interference of additional 
high frequency components induced by the cutoff. The higher the threshold value, the 

10 larger the Gibbs oscillation will be. Since image edges are especially important in 

visual perception, hard logic masking can only be used for weak noise signaL(or 
image) processing, such as electrocardiogram (ECG) signal filtering, where relatively 
small threshold values are required. In this paper, we use a Soft Logic Masking 
(SLM) method. In our SLM approach, a smooth transition band near each masking 

15 threshold is introduced so that any decomposition coefficients, which are smaller than 

the threshold value will be reduced gradually to zero, rather than being exactly set to 
zero. This treatment efficiently suppresses possible Gibbs oscillations and preserves 
image edges, and consequently improves the resolution of the reconstructed image. 
The SLM method can be expressed as 



20 



where Cj,m(k) are the decomposition coefficients to be retained in the reconstruction 



and quantity NQ\ m(k) is defined as 

25 \NC Jm (k)\ 

NC (k} = • * 

max{|NC, m (*)|} (44) 



The softer logic mapping, S:[0,l]->[0,1], is a nori-iinear, monotonically 
increasing sigmoid functional. A comparison of the hard and softer logic masking 
30 fiinctionals is depicted in Figure 13. 

In 2D image processing, it is often important to preserve the image gradient 
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along the xy-direction. For this purpose, we modify the aforementioned softer logic 
functional to 



where £is a normalized adaptive threshold. For an unknown noise level, an useful 
approximation to £is given by 



where a is a scaling factor conveniently chosen as a =1/1.349. The quantity y upper is 
an upper frame boundary of the wavelet transform, i.e. the upper boundary singular 
value of the wavelet decomposition matrix. Using arguments similar to those given by 
Donoho [9], one can show that the above Softer Logic Masking reconstruction is a 
nearly optimal approximation in the minimax error sense. 

EXPERIMENTAL RESULTS 

To test our new approaches, standard benchmark 512x5 12 Y-component images 
are employed. Generally, the possible noise sources include photoelectric exchange, 
photo spots, the error of image communication, etc. The noise causes the visual 
perception to generate speckles, blips, ripples, bumps, ringings and aliasing. The noise 
distortion not only affects the visual quality of the images, but also degrades the 
efficiency of data compression and coding. De-noising and smoothing are extremely 
important for image processing. 

The traditional image processing techniques can be classified as two kinds: linear 
or non-linear. The principle methods of linear processing are local averaging, low-pass 
filtering, band-limit filtering or multi-frame averaging. Local averaging and low-pass 
filtering only preserve the low band frequency components of the image signal. The 
original pixel strength is substituted by an average of its neighboring pixels (within a 
square window). The mean error may be improved but the averaging process will blur 
the silhouette and finer details of the image. Band-limited filters are utilized to remove 
the regularly appearing dot matrix, texture and skew lines. They are useless for noise 




(45) 



(46) 
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whose correlation is weaker. Multi-frame averaging requires that the images be still, and 
the noise distribution stationary. These conditions are violated for motion picture images 
or for a space (time)-varying noisy background. 

The traditional image quality is characterized by a mean square error (MSE), 
5 which possesses the advantage of a simple mathematical structure. For a discrete signal 

{s(n)} and its approximation { s(n) }, n=0, ...,N 9 the MSE can be defined to be 

N (47) 

10 However, the MSE based evaluation standard, (such as 

PSNR=log[(255'255)/MSE]), can not exactly evaluate the image quality if one neglects 
the effect of human perception. The minimum MSE rule will cause strong undulations 
of the image level and destroy the smooth transition information around the pixels. 
Commonly used regularization methods may degrade the image edges and result in visual 

15 blur. 

Generally, unsatisfactory traditional image processing is always defined on the 
whole space (time) region, which does not localize the space (time)-frequency details of 
the signal. New theoretical research shows that non-Guassian and non-stationary 
characteristics are important components in human visual response. Human visual 

20 perception is more sensitive to image edges which consist of sharp-changes of the 

neighboring gray scale because it is essentially adaptive and has variable lens and focuses 
for different visual environments. To protect edge information as well as remove noise, 
modern image processing techniques are predominantly based on non-linear methods. 
Before the smoothing process, the image edges, as well as perceptually sensitive texture 

25 must be detected. The commonly used non-linear filtering approaches include median 

filtering, and weighting average, etc. Median filtering uses the median value within the 
window instead of the original value of the pixel. This method causes less degradation 
for slanted functions or square functions, but suppresses the signal impulses, which are 
shorter than half of the window length. This will degrade the image quality. The most 

30 serious shortcomings of weighting average method are that the weighting-window is not 

adaptive, and large-scale, complicated calculations are required to generate pixel values. 
If the window is made wider, more details will be removed. 
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More efficient human-vision-system-based image processing techniques possess 
the advantages of 1) large range de-correlation for convenience of compression and 
filtering; 2) high perceptual sensitivity and robustness; 3) filtering according to human 
visual response. It therefore can enhance the most important visual information, such as 
edges, while suppressing the large scale of flat regions and background. In addition 4) 
it can be carried out with real-time processing. 

The space (time)-scale logarithmic response characteristic of the wavelet 
transform is similar to the HVS response. Visual perception is sensitive to narrow band 
low-pass components, and is insensitive to wide band high frequency components. 
Moreover, from research in neurophysiology and psychophysical studies/the direction- 
selective cortex filtering is very much like a 2D-wavelet decomposition. .The high-pass 
coefficients of the wavelet transform can be regarded as the visible difference predictor 
(VDP). 

Utilizing the modified wavelet analysis-Visual Group-Normalized Wavelet 
Transform ( VGN-WT) presented in this paper, we can correct the drawback that the raw 
magnitudes of the transform coefficients do not exactly yield the perceptual strength of 
digital images. By use of the soft logic masking technique, the non-linear filtering 
provides edge-preservation for images, which removes the haziness encountered with 
commonly used filtering techniques. 

The first test result is for the so-called "Lena" image, which possesses clear sharp 
edges, strong contrast and brightness. The second picture tested is "Barbara". The high 
texture components and consequently high frequency edges in the Barbararimage create 
considerable difficulties for commonly used filtering techniques. A simple low-pass filter 
will smooth out the noise but will also degrade the image resolution, while a simple high- 
pass filter can enhance the texture edges but will also cause additional distortion. 

We choose 2D half-band Lagrange wavelets as the testing analysis tools for image 
processing. The four 2D wavelets are shown in Fig. 14. 

As shown in Fig. 1 5(a) and Fig. 1 6(a) respectively, adding Gaussian random noise 
degrades the original Lena and Barbara images. The median filtering -(with a 3 X 3 
window) result of Lena is shown in Fig. 1 5(b). If is edge-blurred with low visual quality; 
the speckled noise has been changed into bumps. This phenomenon is even more serious 
for the Barbara image, which has many textures and edges. These features always create- 
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severe difficulties for image restoration. As shown in Fig. 15(c) and Fig. 16(c), it is 
evident that our perceptual DAF wavelet technique yields better contrast and edge- 
preservation results and provides high quality visual restoration performance. 

5 CONCLUSION 

This paper discusses the design of interpolating wavelets based on Lagrange 
functions, as well as their application in image processing. A class of biorthogonal 
Lagrange interpolating wavelets is studied with regard to its application in signal 
processing (especially for digital images). The most attractive property of the 
1 0 interpolating wavelets is that the wavelet multiresolution analysis is realized by discrete 

sampling. Thus pre- and post-conditioning processings are not needed for an accurate 
wavelet analysis. The wavelet coefficients are obtained from linear combinations of 
sample values rather than from integrals, which implies the possibility of using parallel 
computation techniques. 

15 Theoretically, our approach is closely related to the finite element technique for 

the numerical solution of partial differential equations, the subdivision scheme for 
interpolation approximations, multi-grid methods and surface fitting techniques. In this 
paper, we generalize the definition of interpolating Lagrange wavelets and produce three 
different biorthogonal interpolating Lagrange wavelets, namely Halfband Lagrange 

20 wavelets, B-spline Lagrange wavelets and Gaussian-Lagrange DAF wavelets. 

Halfband Lagrange wavelets can be regarded as an extension of'Dubuc 
interpolating functional, auto-correlation shell wavelet analysis and halfband filters. B- 
spline Lagrange Wavelets are generated by B-spline windowing of a Lagrange functional, 
and lead to increased smoothness and localization compared to the basic Lagrange 

25 wavelets. 

Lagrange Distributed Approximating Functionals (LDAF) can be regarded as 
scaling functions (wavelet-DAFs) and associated DAF-wavelets can be generated in a 
number of ways [41]. Both DAFs and DAF-wavelets are smoothly decay in both time 
and frequency representations. The present work extends the DAF approach to signal and 

30 image processing by constructing new biorthogonal DAF-wavelets and associated DAF- 

filters using a lifting scheme [32]. . 

In the first part of our image processing application, we combine two important - 
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techniques, the coefficient normalization method and softer logic visual masking based 
on Human Vision Systems (HVS). The resulting combined technique is called Visual 
Group Normalization (VGN) processing [31]. The concept of Visual Loss-less 
Quantization (VLQ) presented in [39] can lead to a potential breakthrough compared to 
5 the traditional Shannon rate-distortion theory in information processing. 

We also employ a modified version of Donoho's Soft Threshold method for image 
restoration, termed the Softer Logic Perceptual Masking (SLM) technique, for dealing 
with extremely noisy backgrounds. This technique better preserves the important visual 
edges and contrast transition portions of an image than the traditional Donoho method 
1 0 and is readily adaptable to human vision. 

Computational results show that our Lagrange wavelet based VGN processing is 
extremely efficient and robust for digital image blind restoration and yields the best 
performance of which we are aware, when applied to standard Lena and Barbara images. 
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IMAGE ENHANCEMENT NORMALIZATION 
INTRODUCTION 

Medical image, such computed tomography (CT), magnetic resonance image 
(MRI), X-ray mammogram, ultrasound and angiography, is one of major methods for 
field diagnosis. In particular, X-ray mammogram is widely recognized as being the only 
effective method for the early detection of breast cancer. Major advances in screen/film 
mammograms have been made over the past decade, which result in significant 
improvements in image resolution and film contrast without much increase in X-ray dose. 
In fact, mammogram films have the highest resolution in comparison to various other 
screen/film techniques. However, many breast cancers cannot be detected just based on 
mammogram images because of poor visualization quality of the image.. This is due to 
the minor difference in X-ray attenuation between normal glandular tissues and 
malignant disease, which leads to the low-contrast feature of the image. As a result, early 
detection of small tumors, particularly for younger women who have denser breast tissue, 
is still extremely difficult. 

Mammogram image processing has drawn a great deal of attention in the past 
decades [1-9]. Most work focuses on noise reduction and feature enhancement. The 
statistical noise level is relatively low for images obtained by the modern data acquisition 
techniques. Therefore feature enhancement is more essential for mammogram quality 
improvement. Since an image noise reduction algorithm was reported earlier [10], we 
shall focus on image enhancement in this work. In our opinion, there are two basic ideas 
for mammogram feature improvement. One type of methods is to change image spectrum 
distribution so as to increase image edge densities. As a result, the image appears sharper. 
The other approach is the so-called image windowing, in which linear or nonlinear 
grayscale mappings are defined so as to increase/compress image gradient density 
according to device. 

In an early work, one of present authors realized that the frequency responses of 
wavelet transform subband filters are not uniform. A magnitude normalization algorithm 
was proposed to account for this uniform feature [11, 12]. This idea was later extended 
to group normalization for the wavelet packet transform [13] and visual group 
normalization for still image noise reduction [10] by present authors. In this work we 
further extend this idea to achieve image enhancement. 
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Our first approach is based on a rearrangement of image spectrum distribution by 
edge enhancement normalization of wavelet coefficients. Our second technique utilized 
a multiscale functional to obtained device adapted visual group normalization of wavelet 
coefficients. Numerical experiments indicate that our normalization approach provides 
5 excellent enhancement for low quality mammogram image in combination with the use 

of the biorthogonal interpolating wavelets [10] generated by Gaussian Lagrange 
distributed approximating functional [14]. 

EDGE ENHANCEMENT NORMALIZATION 

Mallat and Zhong realized that Wavelet multiresolution analysis provides a 
natural characterization for multiscale image edges, and those manipulations can be 
easily achieved by various differentiations [15]. Their idea was extended by Laine et al 
[7] to develop directional edge parameters based on subspace energy measurement. An 
enhancement scheme based on complex Daubechies wavelets was proposed by Gagnon 
et al. [9]. These authors made use of the difference between real part and imaginary part 
of the wavelet coefficients. One way or another, distorted wavelet transforms are 
designed to achieve desired edge enhancement. 

Our starting point is magnitude normalized or visual group normalized wavelet 
subband coefficients NC Jm (k) [10, 12]. We define an enhancement functional E jm 

(1) 

where A is the Laplacian and -l<a Jm , /3 jma <l. Coefficients a Jm9 fl Jm can be easily chosen 
so that desired image features are emphasized. In particular it enables us to emphasize 
an image edge of selected grain size. We note that a slightly modification of a jm and fi jm 
can result in orientation selected image enhancement. A detailed discussion of this matter 
will be presented elsewhere. An overall re-normalization is conducted after image 
reconstruction to preserve the energy of the original image. We call this procedure 
enhancement normalization. 

DEVICE ADAPTED VISUAL GROUP NORMALIZATION 

30 Contrast stretching is old but quite efficient method for feature selective image 

display. Nonlinear stretching has been used by many authors [3, 7, and 16]. Lu and 
coworkers [16] has recently designed hyperbolic function £ y (£)=[tenh(a/r-' 
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6)+tanh(6)]/[tanh(a-6)+tanh(6)] for wavelet multiscale gradient transformation. Their 
method works well for moon images. The basic idea is to use gradient operators to shape 
a fat image data so that desired portion of the image is projected into a screen window. 
However, in most approaches, the perceptual response of human visual system is not 
5 appropriately accounted. The human visual system is adaptive and has variable lens and 

focuses for different visual environments. We propose a human visual response corrected 
grayscale gradient mapping technique for selected contrast enhancement. 

The human visual system is adaptive and has variable lens and focuses for 
different visual environments. Using a just-noticeable distortion profile, one can 
10 efficiently remove the visual redundancy from the decomposition coefficients [17] and 

normalize them with respect to the importance of perception. A practical simple model 
for perception efficiency has been presented to construct the "perceptual lossless" 
response magnitude Y j m for normalizing according to visual response, 

15 f'^T (2) 

where k is a constant, R is the Display Visual Resolution (DVR), f 0 is the spatial 
frequency, and d m is the directional response factor. A perceptual lossless quantization 
matrix Q jm is [10] 

20 Q,nr2Y lmjtm (3) 

where A jm is a magnitude normalized factor. This treatment provides a simpleiiuman- 
vision-based threshold technique for the restoration of the most important perceptual 
information in an image. For grayscale image contrast stretching, we first appropriately 
normalize the decomposition coefficients according to the length scale, L 9 of the display 

25 device [16] so that they fall in interval of [0,1] of the device frame 

NC^Q^CjJkyL (4) 
We then use a nonlinear mapping to obtain desired contrast stretching 



NQ.m^JCjJNCjJ (5) 

where constant y jm and function^ is appropriately chosen so that desired portion of the 
30 grayscale gradient is stretched or compressed. For example, function 

X (t) - tano ^ ( *-^> +tana '^ (6) 
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can been tuned to stretch any portion the of grayscale gradient. This procedure shall be 
called device adapted visual group normalization. 

EXPERIMENTAL RESULT 

5 To test our new approaches, low-contrast and low quality breast mammogram 

images are employed. Mammograms are complex in appearance and signs of early 
disease are often small or subtle. Digital mammogram image enhancement is particularly 
important for aiding radiologists and for the development of automatically detecting 
expert system. A typical low quality front-view image is shown in Figure 1(a). The 

1 0 original image is coded at 5 12x5 12 pixel size with 2 bytes/pixel and 12 bitsof gray scale. 

We have conducted edge enhancement normalization and device adapted visual group 
normalization. As shown in Figure 1(b), there is a significant improvement in both edge 
representation and image contrast. In particular, the domain and internal structure of 
high-density cancer tissues are clear displayed. Figure 2(a) is an original 1 024x 1 024 side- 

1 5 view breast image which has been digitized to 200 micron pixel edge with 8 bits of gray 

scale. Enhanced image result is depicted in Figure 2(b). In this case we obtain a similar 
result in the previous one. 
CONCLUSION 

Edge enhancement normalization (EEN) and device adapted visual group 
20 normalization (D AVGN) are proposed for image enhancement without prior knowledge 

of the spatial distribution of the image. Our algorithm is a natural extension of earlier 
normalization techniques for image processing. Biorthogonal interpolating distributed 
approximating functional wavelets are used for our data representation. Excellent 
experimental performance is found for digital mammogram image enhancement. 
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VARYING WEIGHT TRIMMED MEAN FILTER FOR THE RESTORATION OF 

IMPULSE CORRUPTED IMAGES 

INTRODUCTION 

Images are often corrupted by impulse noise due to a noisy sensor or channel 
transmission errors. Impulse interference may be broadly defined as corruption which is 
random, sparse, and of high or low amplitude relative to local signal values. Impulse 
noise seriously affects the performance of various signal processing techniques, e.g., edge 
detection, data compression, and pattern recognition. One of the tasks of image 
processing is to restore a high quality image from a corrupted one for use in subsequent 
processing. The goal of image filtering is to suppress the noise while preserving the 
integrity of significant visual information such as textures, edges and details. Nonlinear 
techniques have been found to provide more satisfactory results than linear methods. One 
of the most popular nonlinear filters is the median filter [1-2], which is well-known to 
have the required properties of suppressing the impulse noise and preserving the edges. 
However, it is also true that the median filter is not optimal. It suppresses the true signal 
as well as noise in many applications. Many modifications and generalizations of the 
median filter have been proposed [3-8] for improving its performance. 

In reference [8], we presented a generalized trimmed mean filter (GTMF),which 
is essentially a generalization to the alpha-trimmed mean filter (cc-TMF) [5-6]. In GTMF, 
a ' 'median basket" is employed to select a predetermined number of pixels on both sides 
of the median value to the sorted pixels of the moving window. The luminance values of 
the selected pixels and the value of the center pixel in the window are then weighted and 
averaged to give the filtering output. As mentioned in [8], it is important to have the 
center pixel participate in the averaging operation when removing additive noise, but one 
usually sets the weight of the center pixel to zero when filtering impulse-noise-comipted 
images. 

Although the GTMF outperforms many well known methods in removing highly 
impulse noise corrupted images, it can be further modified to improve the filtering 
performance. In GTMF, the averaging weights are predetermined and fixed throughout 
the filtering procedure. In this paper, a varying weight function is designed and applied 
to GTMF. We give it a new name, varying weight trimmed mean filter (VWTMF).In 
VWTMF, the argument of the weight function is the absolute difference between the 
luminance values in the median basket and the median value. Because we only" 
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concentrate on filtering impulse-noise-corrupted images in this paper, the weight of the 
center pixel in the moving window is always assumed to be zero. For most effectively 
removing impulse noise, we combine the VWTMF with a switching scheme [9] as a 
impulse detector and an iterative procedure [8] is employed to improve the filtering 
performance. 

Varying Weight Trimmed Mean Filter 

The pixels {/„ 7 2 , . . . , 7 m .„ I m , 7 m+1 , .*.»/„} in the moving window associated 
with a pixel I c9 have been sorted in an ascending (or descending) order in the same way 
as in the conventional median filtering technique, with I m being the median value. The 
key generalization to the median filter which is introduced in the alpha-trimmed mean 
filter (a-TMF) [5,6] is to design a median basket in which to collect a given number of 
pixels above and below the median pixel. The values of these pixels are then averaged 
to give the filtering output, A c9 as an adjusted replacementvalue to I c , according to 



1 



(1) 



where Z=[a/iJ, with (ka^O.5. It is evident that a single-entry median basket (L=0) 
a-TMF is equivalent to the median filter and a /i-entry median basket (I=^[0.5J) is 
equivalent to the simple moving average filter. The generalized trimmed mean filter 
(GTMF) [8] uses a median basket in the same way as is does in the a-TMF to collect the 
pixels. The luminance values in themedian basket and the center pixel I c in the window 
are thenweighted and averaged to give the GTMF output: 

Mt+L 

G c = lzZim ~ L 

Wc+ £ HI,- \£) 

where G c is the GTMF output, ana w c and the Wj's are the weights. It is interesting to see 
that when w c =0 and all vv^s are equal to each other (nonzero), the GTMF becomes the 
a-TMF. When all weights except w m are equal to zero, it becomes the standard median 
filter. In the GTMF, the values of the weights are predetermined before filtering and are 
fixed during the filtering procedure. However, it is possible to further improve the GTMF 
by varying the weights according to absolute difference between the luminance values 
in the median basket and the median value. For the removal of impulse noise, we set 
w c =0 and modify the GTMF by varying the weights, so the varying weight trimmed mean- 
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filter (VWTMF) is given by 




(3) 



where, x- is a value in the range of [0,1] defined by 



(4) 



B 



with B being the maximum pixel value of a given type of image (e.g., B=255 for a 8 bit 
gray-scale image). The weight w{x) in Eq. (3) is a decreasing function in the range [0, 1] 
and is taken to be 



The above weight function for A=2 is displayed in Figure 1. Notice that w(0) = 
1 and w( 1 )=0, w(x m )=\ is always the largest weight and the greater the difference between 
the pixel value and the median value, the smaller will be the weight. We anticipate that 
the VWTMF will outperform both the median filter and the a-TMF in suppressing 
impulse noise, while preserving the image edges. As is well known, the median value has 
the least probability to be impulse noise corrupted because the impulses are typically 
presented near the two ends of the sorted pixels. However, the median value may not be 
optimal because it may differ significantly from the noise- free value. The a-TMF will not 
perform better than the median filter when treating impulse noise corrupted images in the 
absence of any other technique (such as the switching scheme) because the corrupted 
pixels may be also selected to the median basket for the averaging operation. In contrast, 
the VWTMF which uses a weighted averaging operation can alleviate the shortcomings 
of both filters. The weight of the median value is the large stand the weights of other 
luminance values in the median basket vary according to their differences from the 
median value. If a impulse corrupted pixel J- happens to be selected for inclusion in the 
median basket, its contribution to the average will be small because x } is large. In general, 
the weight function can assist the VWTMF in eliminating the impulse noise while 
providing a well adjusted replacement value for the center pixel I c . 



w{x) = e~ A (^) . 



(5) 
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It must be noted that although the present implementation o fthe VWTMF 
employs Eq. (5) as the weight function, other monotonically decaying functions in the 
range of [0,1] may also be selected. 



5 THE SWITCHING SCHEME BASED ITERATIVE METHOD 

Many algorithms have been proposed to detect and replace impulse noise 
corrupted pixels of a image [9-12]. In this paper, a switching scheme similar to that used 
in reference [9], based on the VWTMF, is employed to detect impulse noise and recover 
the noise-free pixels. The filtering output 7 C . is generated according to the following 
10 algorithm 



\IP-V C \<T 
K, \W-V C \>T ' (6) 



where /*° is the initial input value and V c is given in Eq. (3). A threshold T is chosen 
15 to characterize the absolute difference between and V c . If the difference is larger 

the threshold, it implies that the pixel differs significantly from its neighbors. It is 
therefore identified as an impulse noise corrupted pixel, and is then replaced by V v If the 
difference is smaller than the threshold, it implies that the original is similar to its 
statistical neighbors, and we identify it as noise-free pixel, and it therefore retains its 
20 original value. 

Iteration of the above scheme will further improve the filtering performance, 
especially for images that are highly corrupted by impulse noise. The iteration procedure 
can be expressed as 



25 I c (t) = 



4°. \iP-V c (t)\<T 
Vc{t) f \IP - V c (t)\ < T ' 



where I c (t) is the system output at time t with 7 C (0)= , V c (t) is the VWTMF filtering 
output given by 

30 if ^-1)1^-1) ' 

j=m— L 
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Note that it is important for the iterative procedure always to compare V c (i) with 
the initial input / ( c ° and to update the output with J™ when their absolute difference 

is less than the threshold 7*. 

NUMERICAL EXPERIMENTS 

The standard 8 bit, gray-scale "Lena" image (size 512x512) is used as an 
example to test the usefulness of our filtering algorithm. We degraded it with various 
percentages of fixed-value (0 or 255) impulse noise. The proposed algorithm is compared 
with the median filtering and a-TMF algorithms, and their peak signal-to-noise ratio 
(PSNR)performances are listed in TABLE L 



TABLE I 

Comparative Filtering Results in PSNR for Lena Image 
Corrupted with Different Amount of Fixed Impulse Noise 



Algorithm" 


15% 


20% 


25% 


30% 


35% 


40% 


Median 


32.19 dB 


31.48 dB 


30.69 dB 


29.91 dB 


29.37 dB 


28.75 dB 


a-TMF 


32.09 dB 


31.30 dB 


30.39 dB 


29.28 dB 


28.38 dB 


27.49 dB 


VWTMF 


32.31 dB 


31.67 dB 


30.89 dB 


30.12 dB 


29.63 dB 


29.06 dB 


Median Switch 


35.20 dB 


33.87 dB 


32.91 dB 


31.84 dB 


30.99 dB 


30.29 dB 


a-TMF Switch 


36.04 dB 


34.93 dB 


33.97 dB 


32.59 dB 


31.67 dB 


30.72 dB 


VWTMF Switch 


36.34 dB 


35.13 dB 


34.29 dB 


33.15 dB 


32.24 dB 


31.43 dB 



* All algorithms are implemented recursively for optimal performance. 



Both direct and switch-based application of the filters are presented in TABLE I for 
comparison, and all algorithms are implemented recursively in a 3 X 3 window for optimal 
performance (The same iterative procedure is used for all the switching scheme based 
algorithms.) A 3-entry median basket (1=1) is used for both a-TMF and VWTMF 
algorithms. The A used in the VWTMF weight function is 2 for images with 15%, 20%, 
25%, and 30% impulse noise, and is 3 with 35% impulse noise, and is 4.5 with 40% 
impulse noise. The threshold used for the switching schemes is 28. 

From TABLE I, it is easy to see that without the switching scheme, the a-TMF 
performs even worse than the median filter. However, it performs better than the median 



WO 99/46731 



PCT/US99/05426 



99 

filter when the switching scheme is used. This reflects the fact that although the a-TMF 
may not perform well in impulse noise removal, it is a good impulse detector. The 
VWTMF performs better than either the median filter or the a-TMF, -whether the 
switching scheme is used or not. It is simple, robust and efficient. The VWTMF performs 
5 well in removing impulse noise, and is simultaneously a good impulse detector. It is 

especially efficient for filtering highly impulse noise corrupted images. Figure 2 shows 
the original noise-free image, the impulse noise corrupted image (40% impulse noise), 
and the filtered results for several algorithms. One can observe from Figure 2 that even 
if there is no switching scheme employed, the VWTMF performs better than either the 
10 median filter or the a-TMF in terms of suppressing noise and preserving edges. The 

a-TMF performs even worse than the median filter when no switching scheme is 
employed. Many speckles still remain in the a-TMF filtered image. However, the 
VWTMF-based switching scheme provides a result that is almost the same as the original 
noise-free image. 

15 

CONCLUSIONS 

This paper presents a new filtering algorithm for removing impulse noise from 
corrupted images. It is based on varying the weights of the generalized trimmed mean 
filter (GTMF)continuously according to the absolute difference between the luminance 
20 values of selected pixels in the median basket and the median value. Numerical results 

show that the VWTMF is robust and efficient for noise removal and as_ an Impulse 
detector. Although the VWTMF is only used for removing impulse noise in this paper, 
we expect that it also will be useful for removing additive noise. 
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A NEW NONLINEAR IMAGE FILTERING TECHNIQUE 
INTRODUCTION 

Images are often corrupted by noise that seriously affects the^ performance of 
various signal processing techniques, data compression, and pattern recognition. The goal 
of noise filtering is to suppress the noise while preserving the integrity of significant 
visual information such as textures, edges and details. Linear local averaging filters are 
essentially low pass filters. Because the impulse responses of the low pass filters are 
spatially invariant, rapidly changing signals such as image edges and details, cannot be 
well preserved. Consequently, impulse noise cannot be effectively removed by linear 
methods; nonlinear techniques have been found to provide more satisfactory results. 
Some of the most popular nonlinear filters are the median filter [1], and its various 
generalizations [2-8], which are well known to have the required properties for edge 
preservation and impulse noise removal. However, the median filter is not optimal since 
it is typically implemented uniformly across the image. It suppresses the true signal as 
well as noise in many applications. In the presence of impulse noise, the median filter 
tends to modify pixels that are not degraded. Furthermore, it is prone to produce edge 
jitter when the percentage of impulse noise is large. In order to improve the performance 
of the median filter, two median based filters, namely the cc-trimmed mean (a-TM) [5-6] 
filter and the modified trimmed mean (MTM) [6] filter, which select from the window 
only the luminance values close to the median value, have been proposed. The selected 
pixels are then averaged to provide the filtering output. Although superior to the median 
filter in some applications, these algorithms still have problems. In general, the MTM 
outperforms the median filter in removing additive noise at the cost of increased 
computational complexity, but not as well as the median filter in removing impulse noise. 
The a-TM filter is in general superior to the median filter as an impulse detector, but its 
performance in removing impulse noise is not so well. As is shown in the test example 
of this paper, the a-TM filter performs even worse than the median filter when no 
impulse detection techniques are employed. Nevertheless, when the level of impulse 
noise is high, the a-TM filter is not optimal for detection, since the selected pixels may 
have a large probability of being corrupted by impulse noise. Removal of additive noise 
is a problem for the a-TM and MTM filters because of not taking account of the central 
pixel in the window. 
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A new filtering technique is disclosed, using what we call a generalized trimmed 
mean (GTM) filter, which in general outperforms a-TM and MTM filters for images that 
are highly corrupted by impulse or additive noise. The GTM filter- is based on a 
generalization of the a-TM filter. In addition to the commonly used moving spatial 
5 window 0(/n,n), a symmetric median basket is employed to collect a predetermined 

number of pixels on both sides of the median value of the sorted pixels in the window. 
The luminance values of the collected pixels and the central pixel in the window are 
weighted and averaged to obtain an adjusted value G{m y n) for the central pixel (m,/z). For 
the removal of additive noise, it is very useful to have the central pixel participate in the 

10 averaging operation because the probability of the luminance value of the central pixel 

to be the closest to the noise-free value is larger than for the other pixels. However, when 
impulse noise is presence, we usually set the weight of the central pixel to zero because 
the impulse-noise-corrupted pixels are independent to the noise-free pixels. For images 
that are highly corrupted with impulse noise, it is in general effective to require the 

1 5 selected values in the median basket to have different weights in order that the impulses 

selected for the basket not affect the filtering output very much. Furthermore, in the 
procedure of impulse noise removal, the proposed filter is combined with a switching 
scheme [9] to produce an impulse detector to preserve the noise-free pixels exactly, 
while providing an optimal approximation for the noise-corrupted pixels. Many impulse 

20 detection algorithms have been proposed [9-12]. In [9], a median filter-based switching 

impulse detector is employed. The basic idea is to calculate the absolute difference 
between the median filtered value and the input value for each pixel. If the-difference is 
larger than a given threshold, the output is the median filtered value; otherwise, the 
output equals the input value. We do the same except that we replace the median filter 

25 with the GTM filter. In order to remove impulse noise effectively, we here propose a new 

iteration scheme which generally improves the performance of the filter. We use both the 
output of the last iteration and the initial input as the input in the current iteration 
calculation. The GTM filter is applied to the output of last iteration to give an 
intermediate output. If the absolute difference between the initial input and the 

30 intermediate output is larger than a predetermined threshold T> the current output is the 

intermediate output, otherwise, the current output is the initial input In contrast to the 
traditional iteration technique, our iteration scheme does not use the output of the last 
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iteration to do the switching operation (the second step). 

GENERALIZED TRIMMED MEAN FILTER 

In the commonly-used median filtering procedure, the luminance value of a pixel 
is replaced by the median value in a neighboring spatial window 

<t>(m,n)=([m-W, m±W\x[n-W, n+W]) (1) 
The size of this moving square window is thus N= (2W+l)x(2W+l). Let the median 
luminance value in this spatial window be denoted as 

M(m,n) = Median ({I(iJ)}\I(U) e 0(m 9 n) (2) 
where is the luminance value at pixel (zj). We reorganize the pixels in the window 
as a new list according to the ascending order of their luminance values- 
/ (l(NA)/2j)<.„<T (-l)<f (0)</(V < .. 

<T (L(#+l)/2j) (3) 
where /(0) is the median value in the neighborhood of pixel (m,«), i.e., / (0)=Af(m,n). 

The key generalization to median filtering introduced in the alpha-trimmed mean 
(a-TM) filter [5-6] is to design a symmetric median basket according to luminance value 
in order to combine a given number of pixels on both sides of the median value of the 
• sorted pixels in the window. The collected pixels are then averaged to give the filtering 
output, as follows 

^■ihiis^ (4) 

where L={aN\ with 0s a<.Q.5. When dM), it becomes the median filter, and when *r=0.5, 
it becomes the simple moving average. In general, the a-TM filter outperforms the 
median filter in detecting the impulse. However, its capability of removing the impulse 
noise is even worse than for the median filter when no additional procedure is employed 
(such as the switching scheme). When an image is corrupted by high levels of impulse 
noise, the ct-TM filter does not perform well, since the pixels being selected for the 
median basket now have a large probability of being impulse corrupted. It is therefore 
unreasonable for the a-TM filter to have the pixels in the basket equally weighted. 
Another generalization to the median filtering is the so called modified trimmed mean 
(MTM) filter [6]. In the MTM filter, a container C(m,n) is employed to select the pixels 
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whose luminance values are in the range of [M(m t n)-q, M(m y n)+q], with q being a 
predetermined threshold. The mean value of the selected values in the container is taken 
as the filtering output. - 

7(/«,/0 = Mean({/(^ (5) 
5 Like the a-TM filter, the MTM filter can also be reduced to the median filter (at q=0 ) 

or the simple moving average (at q=255 for 8bpp gray scale image). The MTM filter is 
useful for removing additive noise but does not perform as well as the median filter when 
impulse noise removal is required, since impulse noise corrupted pixels are independent 
of the noise-free pixels. 

1 0 There is still one problem left for the a-TM and MTM filters where "additive noise 

removal is concerned; they do not take special account of the luminance value of the 
central pixel in the window. As is well known, the value of the central pixel, in general, 
has a larger probability of being the closest to its true value than those of all other pixels 
in the window. We therefore design the generalized trimmed mean (GTM) filter which 

1 5 improves the performance of both the a-TM and MTM filters. In the first step of the new 

filter in process, we employ the median basket to collect those pixels whose luminance 
values are close to the median value, in the same way as is done in the a-TM filter. The 
difference between our algorithm and the a-TM algorithm is that a weighted averaging 
is performed using the values selected for the median basket, as well as the value of the 

20 central pixel (In general, the central pixel is used in removing the additive noise.). Thus, 

L 

w c I(m 7 n) +]>>,/'(/) 
G(m,n) = ^ • (6) 

im-L 

where w c and w/s are the weights for the central pixel and the pixels in the median basket 
25 respectively. When w=0 and all vv/s are the same, it becomes the a-TM filter. Where 

filtering impulse noise corrupted images is concerned, we usually set w=0 because the 
amplitude and the position of the impulse is independent of the original true signal. 
Compared with the a-TM filter, the GTM filter is usually more efficient when highly 
impulse noise corrupted images are to be filtered. For such an image, the weight w 0 for 
30 the median value is usually chosen to be higher than those of other pixels in the median 

basket because the median pixel usually has the least probability to be corrupted by 
impulse noise. Smaller weights for the pixels other than the median pixel can serve as an 
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adjustment for the filtering output, which is important in the iteration calculation. The 
problems of applying the a-TM filter to highly impulse noise corrupted images is that 
the weights for the pixels other than the median pixel are too large, which may give 
filtering results that are still highly corrupted. As explained above, because the GTM 
filter includes the central pixel in the averaging operation, it is reasonable to expect that 
its performance in removing additive noise will be improved compared to the a-TM and 
MTM filters. 

For the removal of impulse noise, a switching scheme [9] based on our GTM 
filter is employed to detect the impulse noise corrupted pixels. The filtering output I{m 9 n) 
for a pixel (m,n) is generated by the following algorithm: 



where 7, is the original input pixel value and T is the switching threshold used to test the 
difference between the original input and output of the GTM filter. 

a) If the difference is larger than T> it implies that the pixel differs significantly 
from its neighbors. It is therefore identified as a noise corrupted pixel, and is replaced by 
G(m,n). 

b) If the difference is smaller than T, it implies the original pixel is similar to its 
statistical neighbors, and we identify it as noise free, therefore retaining its original input 
value. 

For more seriously impulse-noise-corrupted images, an iterative application of 
the above procedure is used to improve the filtering performance. The iteration 
processing can be summarized as 



where I(m,n\t) is the system output at time t, G(m,n\t) is the intermediate output obtained 
by applying [6] to I(ij\t-l) with (ij)eQ(m 9 n). To initialize the algorithm, we set 
I(ij\0y=I£ij). The above procedure can be expressed compactly as 



Y(/M,/?) = 



I^nl |/,(m,«)-G(m ( fi)|<r 
G(m,n\ \I l (m i n)-G(m 7 n)\2:T 



(7) 
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I(m y n)\i) = Iim,n)S{m y n)\t) + G(«,n)|0[l - S{mjn)\t)] (9) 
where the step function S(mji\t) is defined by 

lO, |/,(«,*)-G(m,ii|f)|*r ( 10 ) 

5 

It is important for the iteration procedure to always compare G(m,n\t) with the initial 
input Ii(m 9 n) and update the filtering output with Ifjnji) when their difference is less than 
the threshold T. It is not good to use 7(m,/z|M) instead of I t {m,n) to either do the 
comparison or updating operation. 

10 

EXAMPLE APPLICATIONS 

To test the proposed filtering technique, the benchmark 8bpp gray-scale image, 
"Lena", size 512x512, is corrupted with different percentages of fixed value impulse 
noise (40% and 60%), and the Gaussian noise with peak signal-to-noise ratio (PSNR) 
15 1 8.82 dB respectively. The symmetric moving window size is 3x3, with a 3-entry median 

basket used for both cases. The PSNR, mean square error (MSE) and mean absolute error 
(MAE) comparisons of several different filtering algorithms are shown in TABLE I for 
impulse noise and TABLE II for Gaussian noise. 



20 TABLE I. 

Comparative Filtering Results for Lena Image with Fixed Impulse Noise 



Algorithm* 


Impulse (40%) 


Impulse (60%) 




PSNR 
(dB) 


MSE 


MAE 


PSNR 
(dB) 


MSE 


MAE 


No Denoising 


9.29 


7659.43 


51.03 


7.54 


11467.21 


76.45 


Median (3x3) 


28.75 


86.79 


4.72 


23.38 


298.71 


7.25 


Median (5x5) 


28.15 


99.65 


5.21 


26.15 


157.65 


6.54 


a-TM (3 x3) 


27.49 


115.86 


6.05 


23.44 


294.41 


10.70 


Median Switch 
(3x3) [5] 


30.29 


60.77 


2.54 


23.73 


275.26 


5.50 


a-TM Switch 
(3x3) 


30.72 


55.15 


2.33 


26.66 


140.3 r 


4.52 


GTM Switch (3x3) 


31.18 


49.55 


2.22 


27.57 


113.84 


3.98 



35 "All algorithms are implemented recursively for optimal PSNR and MSE performance. All switching 

algorithms are iterated in the same way and the switching threshold T is 28. Three-entry median basket 
are used in the a-TM and the new algorithms. The weights for our algorithm are w.,:w 0 :w,:w e =l:14: 1: 0. 
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TABLE IL 

Comparative Filtering Results for Lena Image with Gaussian Noise 



Algorithm 


PSNR 
(dB) 


MSE 


MAE 


No Denoising 


18.82 


853.07 


23.71 


Median (3x3) 


27.60 


112.97 


7.72 


Median (5x5) 


27.39 


118.41 


7.53 


MTM (3x3) 


28.23 


97.74 


7.39 


cc-TM (3x3) 


28.13 


100.10 


7.33 


GTM (3x3) 


28.38 


94.48 


7.04 



*A11 algorithms are implemented recursively for optimal 
PSNR and MSE performance. Three-entry median basket 
are used in the last two algorithms. The weights for our 
15 algorithm are w.,: w 0 : w,: w c =l:l:l:10. The threshold q for 

the MTM algorithm is optimized to 100. 

The filtering parameters are also shown in the TABLES. All algorithms are implemented 
recursively for optimal PSNR and MSE performance. The new iteration scheme proposed 

20 in this paper also has been implemented with a-TM filter for filtering of impulse noise. 

It is evident from TABLE I that our GTM filter-based switching scheme yields improved 
results compared to both the median and a-TM filter-based switching schemes. In 
general, the a-TM filter-based switching scheme performs better than the median filter- 
based switching scheme. However, without the switching scheme, the a-TM filter 

25 performs even worse than the median filter for such highly impulse-noise-corrupted 

images, showing that although the a-TM filter is a good impulse detector, it is not good 
at removing impulse noise by itself. As shown in Figure 1(a), the original Lena image is 
degraded by adding 60% impulse noise. Figure 1(b) is the filtering result using the 
method of Sun and Neuvo [9] with a 3x3 window and switching threshold of T=28. One 

30 can observe both blur and speckles on the image; the 3x3 window is not suitable for 

applying the median filtering algorithms for such a high-noise image. These speckles can 
be removed by increasing the window size from 3x3 to 5x5 but at the expense of even 
more blurring after many iterations. Figurel(c) shows our filtering result with the same 
switching threshold. It is seen that our algorithm yields improved results. Because our 

35 algorithm is implemented in a 3x3 window, it is possible to preserve more detail of the 

image than the algorithm of Sun and Neuvo [9] implemented in a 5x5 window with many* 
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iterations. The remaining small number of speckles can be removed by filtering using a 
5x5 window once (with the same basket- weight vector) and then continuing to iterate 
using the 3x3 window; see Figure 1 (d). For the 40% impulse noise corrupted Lena image 
in Figure 2(a), the filtering performance is compared with commonly used median filters 

5 having different window size. In order to remove the speckles from the image, the 

median filters must introduce some blurring and abasing to the image (Figure 2(b) and 
2(c)), while our filtering result is significantly improved (Figure 2(d)) in comparison. For 
the filtering of highly impulse- noise-corrupted images, the performance is usually 
improved when the median value is given a larger weight than other pixels in the median 

10 basket. TABLE II shows that it is reasonable to include the luminance -value of the 

central pixel in the averaging operation for the removing of Gaussian noise. With a 
weight of the central pixel larger than those of other pixels in the median basket, 
improving iteration performance can be obtained in removing additive noise. 

CONCLUSIONS 

In this paper, we present a new algorithm for image- noise removal. A given 
number of significant luminance values on both sides of the median value of the sorted 
pixels in the neighboring window are bundled with the central pixel and weighted to 
obtain a modified luminance estimate (MLE) for the central pixel. For the removal of 
impulse noise, a threshold selective-pass technique is employed to determine whether the 
central pixel should be replaced by its MLE. A new iterative processing is designed to 
improve the performance of our algorithm for highly impulse noise corrupted images. 
For effectively removing additive noise, it is useful for the filtering technique to include 
the value of the central pixel as one of the values being averaged. Numerical experiments 
show that our technique is simple, robust and efficient, and leads to significant 
improvement over other well-known methods. 
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BIOMEDICAL SIGNAL PROCESSING USING A NEW CLASS OF WAVELETS 
INTRODUCTION 

In general, signal filtering and processing may be regarded as a kind of 
approximation problem with noise suppression. According to DAF theory [11, a signal 
approximation in DAF space can be expressed as 



iW = I*(*/)£ a (*-x,) ^ 



where the £Jx) is a generalized symmetric Delta functional sequence. We choose it as 
a Gauss modulated interpolating shell, or the so-called distributed approximating 
functional (DAF) wavelet. The Hermite-type DAF wavelet is given in the -following 
equation [2]. 

The function H u is the Hermite polynomial of even order, 2n. The qualitative behavior 
of one particular Hermite DAF is shown in Figurel. 

Different selections of interpolating shells result in different DAFs. Theoretically, 
this kind of functional can be regarded as the smoothing operator or the scaling function 
in wavelet theory. It can be used to generate the corresponding wavelets (differential 
functionals) for signal analysis. The discrete wavelet transform is implemented using 
filterbanks. 

Additionally, the continuous type of DAF wavelet is used to construct DAF 
wavelet neural nets. The DAF wavelet neural nets possess a modified form the 
commonly used DAF approximation, given as 



£«:=ZiK0*a<*-*/) (3) 

The weights w(z) of the nets determine the superposition approximation g(x) to the 
original signal g(x) eL 2 (R). It is easy to show that the weights of the approximation nets, 
vt>(z), are closely related to the DAF sampling coefficients g(xj). If the observed signal is 
limited to an interval / containing a total of N discrete samples, V={0, 1, AT-1}, the 



WO 99/46731 



PCT/US99/05426 



111 

square error of the signal is digitized according to 

Ea = IlbM-iM? (4) 

This cost function is commonly used for neural network training in a noise-free 
background and is referred to as the minimum mean square error (MMSE) rule. 
However, if the observed signal is corrupted by noise, the network produced by MMSE 
training causes an unstable reconstruction, because the MMSE recovers the noise 
components as well as the signal. MMSE may lead to Gibbs-like undulations in the 
signal, which is especially harmful for calculating accurate derivatives. Thus, we present 
a novel regularization design of the cost function for network training. It generates edge- 
preserved filters and reduces distortion. To achieve this, an additional smooth derivative 
term, E n is introduced to modify the original cost function. The new cost function is 

E=E A +XE r 

= LteW-iWl 2 + ^^l 2 ^ (5) 

k 

The factor JL introduces a compromise between the orders of approximation and 
smoothness. To increase the stability of the approximation system further, an additional 
constraint in state space is taken to be 

% = li^F (6) 
Thus the complete cost function utilized for DAF wavelet neural net training is given by 

-ZW^-^tf+A^fA+^^jgr ( 7) 

MAMMOGRAM ENHANCEMENT 

Medical imaging, including computed tomography (CT), magnetic resonance 
imaging (MRI), X-ray mammography, ultrasound and angiography, includes some of the 
most useful methods for diagnosis. In particular, X-ray mammograms are widely 
recognized as being the most effective method for the early detection of breast cancer.- 
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Major advances in mammograms have been made over the past decade, which have 
resulted in significant improvements in image resolution and film contrast without much 
increase in X-ray dosage. In fact, mammogram films have the highest resolution in 
comparison of various other screen/film techniques. However, many breast cancers 
5 cannot be detected based on mammogram images because of the poor visualization 

quality of the image. There are also many false positives that result in substantial stress 
to patients and their families. Both types of difficulties are substantially due to the minor 
differences in X-ray attenuation between normal glandular tissues, benign formations and 
malignant disease, which leads to the low-contrast feature of the image. As a result, early 

1 0 detection of small tumors, particularly for younger women who have denser breast tissue, 

is still extremely difficult. 

Mammogram image processing has recently drawn a great deal of attention [3 -8] . 
Most work focuses on noise reduction and feature enhancement. The statistical noise 
level is relatively low for images obtained by modern data acquisition techniques. 

1 5 Therefore feature enhancement is more essential for mammogram quality improvement. 

Since an image noise reduction algorithm was reported earlier [9, 10], we shall focus on 
image enhancement in this work. In our opinion, there are two basic approaches for 
mammogram feature improvement. One type of method changes the image spectrum 
distribution so as to increase image edge densities. As a result, the image appears sharper. 

20 The other approach is the so-called image windowing, in which linear or nonlinear 

grayscale mappings are defined so as to increase/compress the image gradient-density. 
In an earlier work, one of present authors realized that the frequency responses of wavelet 
transform subband filters are not uniform. A magnitude normalization algorithm was 
proposed to account for this non-uniform feature. This idea was later extended to group 

25 normalization for the wavelet packet transform [11, 12] and to visual group 

normalization for still-image noise reduction [ 1 3] by the present authors. In this work we 
further extend this idea to achieve image enhancement. 

Our first approach is based on a rearrangement of the image spectrum distribution 
by an edge enhancement normalization of the wavelet coefficients. Our second technique 

30 utilizes a multiscale functional to obtain a device-adapted visual group normalization of 

the wavelet coefficients. Numerical experiments indicate that our normalization 
approach, in combination with the use of the DAF wavelets [13], provides excellent 
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enhancement for low quality mammogram images. 

To test our new approaches, low-contrast and low quality breast mammogram 
images are employed. Mammograms are complex in appearance and. signs of early 
disease are often small or subtle. Digital mammogram image enhancement is particularly 
important for aiding radiologists in long distance consultation, image storage and 
retrieval, and for the possible development of automatic detecting expert systems. Figure 
2(a) is an original 1024x1024 side-view breast image which was obtained from the 
Mammographic Image Analysis Society (MIAS) and has been digitized to 8 bits of gray 
scale. The enhanced image result is depicted in Figure 2(b), and shows significant 
improvement in image quality. 

ECG FILTERING 

Automatic diagnosis of electrocardiogram (ECG orEKG) signals is an important 
biomedical analysis tool. The diagnosis is based on the detection of abnormalities in an 
ECG signal. ECG signal processing is a crucial step for obtaining a noise-free signal and 
for improving diagnostic accuracy. A typical raw ECG signal is given in Figure3. The 
letters P, Q, R, S, T and U label the medically interesting features. For example, in the 
normal sinus rhythm of a 12-lead ECG, a QRS peak follows each P wave. Normal P 
waves show 60-100 bpm with <10% variations. Their heights are <2.5mm and widths 
<0.1 Is in lead II. A normal PR interval ranges from 0.12 to 0.20s (3-5 small squares). A 
normal QRS complex has a duration of <0. 1 2s (3 small squares). A corrected QT interval 
(QTc) is obtained by dividing the QT interval with the square root of the preceding R-R' 
interval (normally QTc=0.42s). A normal ST segment indicates no elevation or 
depression. Hyperkalemia, hyperacute myocardial infarction and left bundle can cause 
an extra tall T wave. Small, flattened or inverted T waves are usually caused by 
ischaemia, age, race, hyperventilation, anxiety, drinking iced water, LVH, drugs (e.g. 
digoxin), pericarditis, PE, intraventricular conduction delay (e.g. RBBB) and electrolyte 
disturbance [20]. 

An important task of ECG signal filtering is to preserve the true magnitudes of 
the P, Q, R, S, T, and U waves, protect the true intervals (starting and ending points) and 
segments, and suppress distortions induced by noise. The most common noise in an ECG 
signal is AC interference (about 50Hz-60Hz in the frequency regime). Traditional 
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filtering methods (low-pass, and band-elimination filters, etc.) encounter difficulties in 
dealing with the AC noise because the signal and the noise overlap the same band. As a 
consequence, experienced doctors are required to carry out time-consuming manual 
diagnoses. 

5 Based on a time varying processing principle, a non-linear filter [ 1 4] was recently 

adopted for ECG signal de-noising. Similar to the selective averaging schemes used in 
image processing, the ECG is divided into different time segments. A sample point 
classified as "signal" is smoothed by using short window averaging, while a "noise" 
point is treated by using long window averaging. Window width is chosen according to 

10 the statistical mean and variance of each segment. However, this calculation is 

complicated and it is not easy to select windows with appropriate, lengths. The 
regularized spline network and wavelet packets were later used for adaptive ECG 
filtering [12, 15], which is not yet efficient and robust for signal processing. In our 
present treatment, regularized D AF neural networks are used to handle a real- valued ECG 

15 signal. We utilize our combined group-constraint technique to enhance signal 

components and suppress noise in successive time- varying tilings. 

The raw ECG of a patient is given in Figure 4(a). Note that it has typical thorn- 
like electromagnetic interference. Figure 4(b) is the result of a low-pass filter smoothing. 
The magnitudes of the P and R waves are significantly reduced and the Q and S waves 

20 almost disappear completely. The T wave is enlarged, which leads to an increase in the 

QT interval. Notably, the ST segment is depressed. Such a low-pass filtering result can 
cause significant diagnostic errors. Figure 4(c) is obtained by using our DAF wavelet 
neural nets. Obviously, our method provides better feature-preserving filtering for ECG 
signal processing. 

25 
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VISUAL GROUP NORMALIZATION USING GAUSSI AN-LAGRANGE DAFWs 
INTRODUCTION 

Distributed approximating functionate (DAFs) were introduced as a powerful grid 
method for quantum dynamical propagations [1]. DAFs can be regarded as scaling 
functions and associated DAF- wavelets are generated in a number of ways [2], DAF- 
wavelets are smooth and decaying in both time and frequency domains and have been 
used for numerically solving linear and nonlinear partial differential equations with 
extremely high accuracy and computational efficiency. Typical examples include 
simulations of 3D reactive quantum scattering and 2D Navier-Stokes equation with non- 
periodic boundary conditions. The present work extends the DAF approach to image 
processing by constructing interpolating DAF-wavelets[3]. An earlier Group 
normalization (GN) technique [4] and human vision response [5] are utilized to normalize 
the equivalent decomposition filters (EDFs) and perceptual luminance sensitivity. The 
combined DAF Visual Group Normalization (VGN) approaches achieve robust image 
restoration result. 

INTERPOLATING DAF WAVELETS 

Interpolating wavelets are particularly efficient for calculation since their 
multiresolution spaces are identical to the discrete sampling spaces. Adaptive boundary 
treatments and irregular samplings can be easily implemented using symmetric 
interpolating solutions. We design the interpolating scaling function as an interpolating 
Gaussian-Lagrange DAF (GLDAF) [6] 

" 0) 

= KW n — 

where WJ[x) is selected as a Gaussian window since it satisfies the minimum-frame- 
bound condition in quantum physics. Quantity a is width parameter. PJjc) is the 
Lagrange interpolation kernel. DAF dual scaling and wavelet functions are constructed 
for perfect reconstruction [3]. The Gaussian window efficiently smoothes out the 
oscillations, which plague many wavelet bases. As shown in Figure 1, both DAF 
wavelets and their dual partners display excellent smoothness and rapid decay comparing 
with the popular B97 wavelets. 
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VISUAL GROUP NORMALIZATION 

Wavelet transform is implemented by a tree-structure filtering iteration. The 
coefficients can be regarded as the output of a single equivalent decomposition filters 
(EDF). As shown in Figure 2(a) and Figure 2(b), EDF responses without normalization 
are much different. Clearly, the decomposition coefficients cannot exactly represent the 
actual signal strength. As an adjusted solution, wavelet coefficients, C m {k\ in block m 
should multiply with a magnitude scaling factor, Z m . We define this factor as the 
reciprocal of the maximum magnitude of the EDF frequency response [4] (where LC m \s 
m-th EDF response). 

^■«nKl^C)|} n=[0 ' 2}rh (2) 

wen 

The normalized EDF responses of DAF and B97 wavelets are shown in Figure 
2(c) and Figure 2(d), respectively. It is easy to find out that DAF's possess smaller 
sidelobes, which induce less frequency leakage distortion. 

An image is human-vision-dependent source. Using a just-noticeable distortion 
profile, we can efficiently remove the visual redundancy as well as normalize the 
coefficients in respect to perception importance. A mathematical model for "perception 
lossless" matrix Y m has been presented in [5] and is used as perceptual normalization 
combined with the EDF magnitude normalization. (Noted here we use X m for magnitude 
normalization not the wavelet "basis function amplitude" in [5], because the digital image 
decomposition is completely operated using filter banks) We denote the combination of 
the two normalized operations as Visual Group Normalization (VGN). 

EXPERIMENTAL RESULTS 

To test our approaches, two benchmark 512x512 Y-component images are 
employed (Barbara with much texture and high frequency edges, while Lena with large 
area of flat region). The PSNR comparison results are shown in Table I, while the 
perceptual quality comparison of Lena is shown in Figure4. It is evident that our 
technique yields better performance. 
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NONLINEAR QUINCUNX FILTERS 
INTRODUCTION 

The possible image noises include photoelectric exchange* photo spots, the 
error of image communication, etc. The noise causes the speckles, blips, ripples, 
bumps, ringings and aliasing that not only affects the visual quality, but also 
degrades the efficiency of data compression and coding. Developing de-noising 
and smoothing technique is important. 

The popularly used noise model is Gaussian, since it is easy for linear 
(stationary) analysis. In practice, the further physical environments are more 
accurately modeled as impulsive, which characterized by heavyrtailed non- 
Gaussian distributions. Such noises include atmospheric noise (due to a lightening 
spike and spurious radio emission in radio communication), ice cracking and 
aquatic animal activity in sonar and marine communication, and relay switching 
noise in telephone channels. Moreover, a great variety of synthetic noise sources 
(such as automatic ignitions, neon lights, and other electronic devices) are also 
impulsive. Impulsive interference may be broadly defined as signal corruption that 
is random, sparse, and of high or low amplitude relative to nearby uncorrupted 
signal values. The performance of filter systems developed under the assumption 
of Gaussian condition are severely degraded by the non-Gaussian noise due to 
large deviations from normality in the tails. Independent on the physical 
environment, relatively infrequently occurring and non-stationary are the 
important features of impulse noise that we can not obtain an accurate statistical 
description. Therefore, it is impossible to design an optimal linear filtering systems 
based on the generalized likelihood ratio (GLR) principles. Consequently, by first 
suppressing the impulsive component and then operating on the modified signal 
with traditional linear system, near-optimal performance is obtained. The non- 
Gaussian nature of the impulse noise dictates that the suppression filter be 
nonlinear and robust due to the presence of impulses. Traditional image processing 
is always defined on the whole space (time) region, which does not localize the 
space (time)-frequency details of the signal. The mean error may be improved, but 
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the averaging process will blur the silhouette and finer details. 

New research shows that non-Gaussian and non-stationary characterize the 
human-visual-response (HVS) based image processing. Human visual perception 
is more sensitive to image edges which consist of sharp-changes of the 
neighboring gray scale because it is essentially adaptive and has variable lenses 
and focuses for different visual environments. To protect edge information as well 
as remove noise, modern image processing techniques are predominantly based 
on non-linear methods. Before the smoothing process, the image edges, as well as 
perceptually sensitive texture must be detected. Some of the most popular 
nonlinear solutions are the median filter [1] and its generalizations [2, 3], that 
possess the required properties for impulse noise removal. However, the median 
filter is not optimal since it is typically implemented uniformly across the image. 
It suppresses the true signal as well as the noise in many applications. In the 
presence of impulse noise, the median filter tends to modify pixels that are not 
degraded by noise. Furthermore, it is prone to produce edge jitter when the 
percentage of impulse noise is large. 

A significant characteristic of the impulse noise is that only a portion of the 
pixels are corrupted and the rest are totally impulse-noise-free. The key point for 
filter design is the need to preserve the noise-free pixels exactly, while providing 
an optimal approximation for the noise-corrupted pixels. An impulse detector is 
required to determine the noise-corrupted pixels [4-8]. In [4], a median filter- 
based switching scheme is used to design the impulse detector. The basic idea is 
to calculate the absolute difference between the median filtered value and the 
original input value for each pixel. If the difference is larger than a given 
threshold, the output is the median filtered value; otherwise, the output is the 
original input value. However, using a single median filtered value can not 
increase the detection probability of impulse noise. In addition, median estimation 
is not optimal for high-percentage impulse-degraded image restoration.. In this 
paper, we propose a so-called "median radius" to design a collecting basket. The 
"basket members", whose luminance values are close enough to median value, are 
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weighted averaging to generate an adjustable impulse detector (optimal estimator). 
Noted here our technique is not the previously presented ar-trim solutions [2]. For 
ar-trim filtering, the median is calculated at first, then the pixels in" a neighboring 
window are ordered according to the absolute difference from the median value 
(from minimum to maximum). The first three (or several) pixels (close to median) 
are weighted averaging to substitute the observed pixel luminance. In our method, 
we only collect the weighted pixels which are within the "median radius" (a-trim 
member may exceed this domain) for averaging. Additionally, a switching 
threshold is introduced to determine whether the observed pixel need filtering or 
not. Only the noise-corrupted one is changed. 

To improve the filtering performance, a "quincunx moving window" is 
presented to define the arbitrary shaped neighborhood (traditional window is the 
trivial square shape). The research in neurophysiology and psychophysical studies 
shows that the direction-selective cortex filtering mimics the human vision system 
(HVS). These studies have found a redial frequency selectivity that is essentially 
symmetric with bandwidths nearly constant at one octave [12]. This feature is 
much like the 2-D quincunx decomposition. Theoretical analysis and simulation 
show that our quincunx filtering technique is quite robust and efficient for 
different noise background. The paper is organized as follows: In Section II, the 
mathematical model of noise is described. In Section III, the image processing 
technique based on a quincunx filter is studied. Section IV concerns the arbitrary 
shape extension of the quincunx filter. A simulation is given in Section V. Section 
VI contains the conclusion. 

NOISE MODEL 

Additive Random Noise 

For a 2-D digital image, we assume the noise-free image value at pixel 
(m,ri) is s(m,n) y and I(m,n) is the observed image representation. If the observation 
model is a random noise corrupted case, the pixel value can be depicted as 
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I(m,n)=s(m,n)+w(m,n) ( 1 ) 

where w(m,ri) is the noise components, and its magnitude is random. Normally we 
assume w(m,n) is a Gaussian process with zero mean and standard deviation cr H .. 
For a non-stationary process, cr w is time-varying. The important characteristic of 
such noise is that all pixels may be degraded and the noise amplitude is 
statistically non-uniform. 

IMPULSIVE NOISE 

Normally, impulse noise is a result of a random process that the value of 
the corrupted pixel is either the minimum or a maximum value of ^a display 
instrument. The overall noise character could be positive (maximum), negative 
(minimum) or mixture (salt and pepper). The characteristic of such kind of noise 
is that the pixels are degraded with probability p. The noise model is expressed as 



v(m,n) = 



e(m,/i), with p ^) 
I(ntJ7) t with 1 - p 



where e(m,ri) is a binary random number. For an 8-bit gray scale image 
representation (0=minima; 255=maxima), its value is 

[0, withpo 

eM== \255, wMl-A . ( 3 ) 

The probability of the black (minimum) value is p 0 and the probability of the white 
(maximum) value is l-p 0 . Normally for the positive impulse case, /? 0 =0; for the 
negative impulse case,/? 0 =l; and for the salt and pepper, p 0 =l/2. Any alternation 
style can be obtained by different selection of the probability p 0 . 

More complex impulsive noise models are generated by a random 
magnitude degradation selection. The random value impulse noise array v(m,ri) is 
represented as 

v(m, n)=z(m, n)r{m t n) (4) 
where the impulse generation r(m,n) is a random process representing an ever-present 
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impulse component with standard deviation and z(m.ri) is a switching sequence of 
ones and zeros. Impulse noise seriously affects the performance of various signal- 
processing techniques, e.g. edge detection, data compression, and pattern recognition. A 
median filter is the commonly used nonlinear technique for impulse noise removal. 

Assume we have fixed a median window (possibly with any size) for median 
filtering. Let N m n denote the number of impulse-corrupted pixels in median window 
®(m,n) of node (m t n). If the window size is W (conventionally the 3x3 or 5x5 square 
window), then it is known that the error measure is 

£{ w I w = = , pr £(w> 

l ~ Pr l x med -*(«,«)] (5) 



where median estimation probability Pr[x med =s(m f n)]=Pv[N mn <(WA)/2] is given in [5]. 
It is easy to conclude that when the corruption probability p is much higher, most 
possibly that N m >(W-\)l2. This implies that the convincing probability (CP) of the 
median filtering (Pr[x med =s(m,n)]) is very small (e.g. with small probability, the output 
of a median filter is close to the original noise-free value). In other words, a traditional 
median filter is invalid for highly noise-corrupted image processing. For a uniformly 
noise degraded background, no matter what size the selective window is, statistically the 
performance of median filtering cannot be optimized (because the probability 
?r[N m <(W~\)I2] is the fixed statistically). Our objective is to improve the performance 
of nonlinear filtering and reduce the error measure above. 

NONLINEAR FILTERS 

In the median filtering procedure, the luminance value of a pixel is replaced by 
the median value in a neighboring spatial square window 

ri)={[m-m j , m+m 2 ]x[n-n XJ n+n 2 ]) (6) 
The size of this moving rectangular window is N=(m t +m 2 +l)x(n } +n 2 +l). The 
conventional trivial windows are 3x3 or 5x5. Let the median luminance ivalue in this 
spatial window be denoted as 

M(™ f n)=Median{/(m ',n ')|(m ;„ ^eO} (7) 
where I(m ' t n ) is the luminance value at pixel (m 'n '). We reorganize the pixels in the 
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window as a new list according to the order of their luminance value. 

I'(l(NA)/2])<. . .</'(0)<. . .<I'(Nl(N+l)/2]) (8) 
where /'(0) is exactly the median value in the neighborhood of pixel (m t ri), 

I'(0)=M(m,n) (9) 
5 The key generalization to median filtering introduced in this paper is to design 

a "basket" according to luminance value in order to combine a group of pixels whose 
luminance levels are close to the median value of the window <D(m,rt). For each entry in 
this basket, a weighted average scheme is utilized to generate an adjusted median value 
as 

10 2>,''(0 

2»i • (10) 

where Q is the set of pixels whose luminance values are close to median value in the 
window. The different design of our method from the a-trimmed filter is that we 

15 introduce an adjustable basket. The median distance (MD) A is defined as the absolute 

luminance difference between the observed value and the median filtered value M(m,n). 

A (m t n) = \I{m t n)-M{m,n)\ (11) 
Only the pixels whose median distances are within a median radius (MR) y (e.g. A (m, n)< 
y, y>0), are selected as the "basket member" in Q for weighted averaging. Otherwise, 

20 their weights are set to zero. Normally, W 0 is larger than or equal to the other, weights 

W ( , zVO. We call D(m,n) the modified luminance estimate (MLE) for pixel (m,n). Note 
that when the basket Q contains only one pixel /(0) (or the median radius y=0), the filter 
is identical to the median filter, i.e., the median filter is a special case of our presented 
nonlinear filters. 

25 However, the modified luminance estimate (MLE) is not the ultimate filtered 

value to substitute the observed pixel. A switching scheme [2] based on MLE is 
employed to detect the impulse noise. The corresponding filtering output I(m,ri) for a 
pixel (m,n) is generated by the following algorithm: 



30 /(«.*) = 



\Ii(m,n)-D(m t n)\<T 
D( mr n), |/,Kn)-D(m,n)|>7 ' (12) 
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where L t is the original input image. A threshold Tis used to test the difference between 
the original pixel value and the MLE value. The difference between the observed value 
I{m t n) and the adjusted median value D(m t n) from the basket can be regarded as the 
visible difference predictor (VDP). 

a) If the difference is larger than the threshold, it implies that the pixel differs 
significantly from its neighbors. It is therefore identified as a noise corrupted pixel, and 
is replaced by Z)(m,n). 

b) If the difference is smaller than the threshold, it implies the original pixel is 
similar to its statistical neighbors, and we identify it as noise free, therefore retaining its 
original value. For more seriously noise-corrupted images, an iterative application of the 
above procedure is required to obtain satisfactory performance. The iteration processing 
can be depicted as 

«m,n\t) = \ IiM l / /(*.»)-««.»U-»)Kr (13) 

[D(m,/7|/-l) t |/ / (m,/7)-D(m,^U-l)|>7 , » 

where I(m t n\t) is the system output at time /, D(m,n\tA) is the MLE value of pixel 
(m,n) at time M. To initialize the algorithm, we set I(m,n\Q)=Ifcn,n). The above 
procedure can be simply expressed as 

I(m t n I /) = (m, n)S(m t n \ /) + D(m t n \ t - - n | /)] , (14) 

where the step function S(m,n\t) is defined as 

fl. \Ii(m t n)-D(m y n\t-\)\<T 
5(m ' n| ' ) = l0, |/,(«.i.)-D(-H.n|f.l)fer» ( 15 > 

and it determines which value will be assigned to the pixel. 

The proposed nonlinear filtering includes two steps; the first step is to obtain the 
adjusted reference value MLE of the observed pixel using median distance-based 
algorithm, and the second step is utilizing an active switching process to determine if the 
observed pixel is substituted by the MLE or not. (Noted for the traditional median 
filtering or a-trimmed solution, the pixel is only substituted by the reference value 
without any switching process.) A special nonlinear process is designed to switch 
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between the original pixel and MLE, depending upon the identification of the nature of 
the pixel. If the pixel is determined as noise-free, the original value should be preserved. 
Otherwise, it is replaced by the regressively weighted-median value. 

Our filtering is like Olympic scoring procedure, the lowest score and highest 
scores (outside the median radius) are removed and the remaining scores are then 
averaged to obtain the evaluation of the participant. In our approach, we remove several 
of the lowest luminance values and several of the highest luminance values from the 
pixels in the window. The remaining luminance values are then weight-averaged to give 
the filtering output for the pixel. We may consider the weighted average as assigning 
different referees different weights in scoring the participants. 

Theoretically, the MLE generation increases the probability 

?r[x^m,n)]^N^W-iy2] (16) 
When the size of median basket is set to Af, it equals to that the median window size W 
increases M times as MW y but with the same noise corrupted number N m n . In this case, 
the detection probability becomes 

Pr[x merf =5(m,n)]=Pr[^<(^M-l)/2] (17) 
This implies that the modified filtered output is more likely equal to the original noise- 
free value. 

QUINCUNX EXTENSION 

As shown in Figure 1, the traditional median or averaging filter considers the 
pixels in a square region. The most popular size is a 3x3 or 5x5 window, which are used 
in the most popular papers and software packets (Photoshop and Lview, etc) on image 
processing. The filtering method or algorithm is applied in the square region (black 
pixels). One disadvantage of square window processing is that it can not adaptive to 
different noisy background and exist biases along the coordinate directions, or the spatial 
distance. This may cause visual distortion. 

Actually, besides the basic square window, the median basket (or the operation 
region) possesses numerous extended solutions. For example, the 5x5 extension is shown 
in Figure 2. The traditional square shape approach gives preferential treatment to the 
coordinate axes and only allows for rectangular divisions of the frequency spectrum. The 
symmetry axes and certain nonrectangular divisions of the frequency spectrum 
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correspond better to the human vision system (HVS). These are typically concerned with 
two and three dimensions, as the algebraic conditions in higher dimensions become 
increasingly cumbersome. The predominant advantages of quincunx filtering is that there 
is very little biasing along coordinate directions, the sampling is nearly isotropic; i.e. 
there is no mixing of different scales; computationally excellent compression of 
discontinuous functions; and a simple representation of operators. In the 5x5 region, the 
basket size (black dot number) can be (1, 5, 9, 13, 17, 21, and 25). Thus we will have 
more alternation to offset the influence of over-smoothing caused by large window (5x5) 
processing. 

Moreover, some specially designed quincunx windows are conductive to remove 
the particular noise along different spatial direction, such as the quantization and 
thresholding noise of transform filtering and coding (discrete cosine transform (DCT) and 
wavelet transform (WT), etc.). Because the high-dimensional transforms (such as 2D 
DCT and wavelet transform) are usually the tensor products from ID transform (filtering) 
along different spatial directions, the coefficient error in the transform domain causes the 
distortion in the physical domain with the shape of the basis function (2D cosine and 
wavelet function). It usually looks like a small "cross" impulse. Even for the complicated 
non-separable wavelet transform case, the distortion is just a "rotated cross". Some 
quincunx windows will match the exact correlation characteristics of transform noise, as 
well as obtain the optimal performance. These filters can be utilized as a post-processor 
after the transform-based image processing and thresholding techniques. In the next 
chapter, we will report our quincunx post-processing results for a wavelet'thresholding 
technique. 

EXPERIMENTAL RESULTS 

We now present the results of computer simulations to demonstrate the 
effectiveness of our proposed techniques. Two objective criteria, the signal-to-noise ratio 
(SNR) and signal-to-impulse ratio (SIR) are used to evaluate and compare the 
performance. The signal-to-noise ratio, which is given by 
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is used to evaluate the overall performance of the proposed preprocessor including 
prediction, nonlinearity and reconstruction capabilities. Another useful quantity, the 
signal-to-impulse ratio (SIR), is given by 

s/* = ioi 0 gp|J (19) 

The SNR rule is derived from the traditional image quality criterion, which is 
characterized by a mean square error (MSE). It possesses the advantage of a simple 
mathematical structure. For a discrete signal {s(n)} and its approximation {s(n)}, n=0, 
. . . , N, the MSE can be defined to be 

M!?£ = I■£[i(/.)-^(«)] , (20) 

However, the MSE based evaluation standard, such as peak signal-to-noise ratio (PSNR), 

PSNR =log[(255x255)/MSE] (21) 
can not exactly evaluate the image quality if one neglects the effect of human perception. 
The minimum MSE rule will cause undulations of the image level and destroy the smooth 
transition information around the pixels. Therefore, we use both objective and subjective 
standards to evaluate our presented filtering results. 

The benchmark 8bpp gray-scale images are corrupted by additive impulse noise 
and Gaussian noise to test the proposed filtering technique. The practical symmetric 
quincunx windows are selected as shown in Figure 3. The peak signal-noise-ratio 
(PSNR), mean square error (MSE) and mean absolute error (MAE) comparison of 
different filtering algorithms for both images are shown in TABLE I (The amount of 
noise and the filtering parameters are also shown in TABLE I.). It is evident that our 
filtering-based switching scheme yields improved results compared to the median 
filtering-based switching scheme. In addition, the quincunx extension improves the 
filtering performance of the nonlinear processing. When noise probability is lower (20%), 
5-point diamond (or cross) quincunx filter possesses the better performance. When noise 
probability is higher (60%), 13-point diamond quincunx filter is testified better. The 9- 
point square window is suitable for 40% noise-degraded case. The perceptual quality of 
our algorithm is shown in Figure 4. The original Lena image is degraded by adding 40% 
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impulse noise in Figure 4(a). Figure 4(b) is the filtering result using our nonlinear filter. 

Wavelet noise is a special kind of additive noise generated by coefficient 
quantization error or thresholding error. Because the restored image is- regarded as the 
linear or quincunx combination of different wavelet functions (scaling arid translation), 
5 the noise in the physical domain will be "random wavelets". For example, in a tensor- 

product wavelet transform case, the wavelet noise has a cross shape. Our quincunx filter 
can be utilized as a post-processor for a wavelet de-noising technique. As shown in 
Figure 5, Figure 5(a) is the Gaussian noise-degraded Lena. Figure 5(b) is the denoising 
result using DAF wavelet thresholding technique. Figure 6(c) is the one-time nonlinear 
!0 quincunx restoration combined with wavelet thresholding. The result has- higher visual 

quality and 0.65dB PSNR improvement. * . 

CONCLUSIONS 

In this paper, we present a nonlinear quincunx filter for impulse or mixed noise 
removal. A specially designed "median basket" based on median distance is used to 

15 collect the "basket members" for calculating the modified median estimate (MLE). A 

switching scheme is used to detect the impulse noise and preserve the noise-free pixels. 
Arbitrary shape quincunx windows are introduced to improve the visual filtering 
performance of our nonlinear filter. The quincunx version takes account of the different 
correlation structure of the image along different spatial directions, based on the human 

20 vision system (HVS). Iterative processing improves the performance of our algorithm for 

highly corrupted images. Numerical simulations show that the quincunx 'filtering 
technique is extremely robust and efficient, and leads to significant improvement in 
different noise-degraded case. The method can be combined with any restoration 
technique to constitute a robust restoration method (for example as the post-processor for 

25 wavelet thresholding techniques). We compared the performance of these techniques, 

both subjectively and quantitatively, with the median filter and two of its recently 
proposed variants. Special attention was given to the ability of these methods to preserve 
the fine image details, such as edges and thin lines. In the experiments, our filtering gave 
the best results. 
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VISUAL MULTIRESOLUTION COLOR IMAGE RESTORATION 
INTRODUCTION 

Images are often contaminated by noise. Generally, the - possible noise 
sources of images are photoelectric exchange, photo spots, imperfection of 
communication channel transmission, etc. The noise causes speckles, blips, 
ripples, bumps, ringing and aliasing for visual perception. These distortions not 
only affect the visual quality of images, but also degrade the efficiency of data 
compression and coding. De-noising and restoration are extremely important for 
image processing. 

The traditional image processing techniques can be classified into two 
kinds: linear or non-linear. The principle methods of linear processing are local 
averaging, low-pass filtering, band-limit filtering or multi-frame averaging. Local 
averaging and low-pass filtering only preserve the low-band components of the 
image signal. The original voxel is substituted by an average of its neighboring 
voxels (within a square window). The mean error may be improved but the 
averaging process blurs the silhouette and details of the image. Band-limited filters 
can be utilized to remove the regularly appearing dot matrix, texture and skew 
lines. But they are useless for irregularly distributed noise, Multi-frame averaging 
requires the images be still, and the noise distribution be stationary, which is not 
the case for motion picture images or for a space (time)- varying noisy background. 
Generally, traditional image processing is always defined on the whole space 
(time) region, which cannot localize the space (time)-frequency details of a signal. 
New research evidence shows that non-Guassian and non-stationary processes are 
important characteristics for the human visual response. 

Moreover, traditional image quality is evaluated by the mean square error 
(MSE), which possesses the advantage of a simple mathematical structure. For a 
discrete signal {s(n)} and its approximation {s(n)} 9 ri=Q, 7V-1, the; AfSE can be 
defined as 

0) 
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However, a MSE based measure, (such as PSNR=\og[(255x255)IMSE\\ does not 
exactly evaluate image quality that consistent with human perception. The 
minimum MSE rule may cause serious oscillatory (no monotonic) behavior in the 
convergence process, which will destroy the original smooth transition-around the 
pixels. Commonly used regularization methods, such as regularized least squares, 
may degrade the image details (edges and textures) and result in visual blur. 

Human visual perception is more sensitive to image edges which consist 
of sharp-changes in the neighboring luminance/chrominance scale. Visual 
perception is essentially adaptive and has variable lenses and focuses for different 
visual environments. To protect as much as possible the detailed information, 
while simultaneously removing the noise, modern image processing techniques are 
predominantly based on non-linear methods. Commonly, prior to nonlinear 
filtering, the image edges, as well as perceptually sensitive textures, are detected 
and preserved. The well-known nonlinear filtering approaches include median 
filtering and weighted averaging, etc. Median filtering uses the median 
luminance/chrominance value of the pixels within the neighboring window to 
substitute for the original voxels. This method causes less degradation for the slant 
or square functions, but suppresses the signal impulses whose widths are shorter 
than half of the window width. Thus it can degrade image quality. To protect the 
edges, weighting averages only smooth neighboring pixels with similar 
luminance/chrominance magnitude. However, a serious shortcoming of these 
methods is that the width of the weighting-window has to be implemented 
adaptively. So the large-scale, complicated calculations are required to generate 
filtering voxels. If the window selected is too wide, more details may be lost. 

Human vision system response has attracted much interest for image 
processing recently [9, 1 8, 23, 40, 43, 49, 54]. Using some newly developed non- 
linear methods (such as perceptually optimal stack filtering [18]) Satisfactory 
restoration images can be obtained. However, this type of nonlinear filter requires 
a thorough knowledge of the ideal image and involves lengthy long-time network 
training, which greatly limits its usefulness. 
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More efficient, human- vision-system-based image processing techniques 
possess the advantages of 1) high de-correlation for convenience of compression 
and filtering; 2) high numerical stability. 3) In addition, in human visual response, 
the filtering algorithm should enhance perceptually sensitive information, while 
simultaneously suppressing the non-perceptual-sensitive components. 4) Finally, 
it can be carried out with real-time processing and is robust. 

The space (time)-scale logarithmic response characteristic of the wavelet 
transform is quite similar to that of the HVS response. Visual perception is 
sensitive to narrow band, low-frequency components, and insensitive to wide- 
band, high frequency components. This feature can be dealt with using the 
constant-Q analysis of wavelet transforms, which possesses fine resolution in the 
low-band regime, and coarse frequency resolution in the high band regime. The 
recently discovered biological mechanism of the human visual system shows that 
both multiorientation and multiresolution are important features of the human 
visual system. There exist cortical neurons which respond specifically to stimuli 
within certain orientations and frequencies. The visual system has the ability to 
separate signals into different frequency ranges. The evidence in 
neurophysiological and psychophysical studies shows that direction-selective 
cortex filtering is much like a 2D-wavelet decomposition representation. The high- 
pass expansion coefficients of a wavelet transform can be regarded as a kind of 
visible difference predictor (VDP). 

The use of wavelets for the task of image restoration and enhancement is 
a relatively new but rapidly emerging approach [3, 26, 27, 29, 38, 40], Although 
there has long been the view that a non-stationary approach may improve results 
substantially compared to a stationary one, the idea of multiresolution has not been 
a prevalent one. Instead, adaptive restoration techniques have been used to 
examine problems in the spatial domain. Various local measures are employed to 
describe the signal behavior near a pixel. Because local adaptivity is based 
explicitly on the values of the wavelet coefficients, it is very easy to implement 
and requires a much smaller effort than the conventional deconvolution filter. 
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To enhance the performance of wavelet techniques for signal-noise-ratio 
improvement in signal restoration, it is typical to utilize thresholding techniques 
in multiscale spaces [13, 26, 27, 37, 38]. However, for perceptual image 
processing dependent on the human vision system (HVS), this technique appears 
not to be optimal. As is well known, the HVS model addresses three main 
sensitivity variations, namely, the light (luminance/chrominance) level, spatial 
frequency, and signal content. The visual content of the wavelet coefficients in 
various subblocks (representing different spatial frequency band) is quite different. 
Additionally, the contrast level seriously affects the level of a threshold cutoff. 
Simple multi-channel thresholding methods do not yield perfect perception quality 
since they are not fully consistent with HVS response. Furthermore, due to the 
recurrent structure of the subband filters, the coefficient strength in each sub-block 
is also rapidly varying. In addition to these difficulties, the problem of color image 
restoration presents a unique problem in that the multiple color channels are not 
orthogonal. Thus, cross-channel correlation must be exploited in order to achieve 
optimal restoration results. A number of approaches have been used to handle the 
color multichannel image restoration problem [2, 3, 31, 49]. However, the re- 
normalization of the subband response and the HVS has not yet been studied 
thoroughly. 

The present paper deals with these issues and seeks to develop an efficient 
method for noisy color image restoration. We will concentrate on YCrCb channels 
because they are relatively de-correlated and can be processed independently. 
Magnitude normalization (MN) is utilized to adjust the wavelet transform 
coefficients in the different sub-channels. Also, different visual weightings are 
employed in the three luminance-chrominance channels (Y, Cr, Cb) to make the 
wavelet transform coefficients in these channels better match the vision system 
response better. These visual weightings have been used before by Waston to 
generate a perceptual lossless quantization matrix for image compression [49]. For 
non-standard image brightness level (contrast), visual sensitivity normalization 
(VSN) is also needed to fix the cutoff threshold. We refer to these combined three 
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normalization processes as Color Visual Group Normalization (CVGN). 

In applications, recently developed interpolating Lagrange wavelets [41, 
50] are utilized for color image decomposition and reconstruction. This kind of 
interpolating wavelet displays a slightly better smoothness and more rapidly time- 
frequency decay than commonly used wavelets. Moreover, the interpolating 
processing enables us to utilize a parallel calculation structure for efficient real- 
time implementation. Most importantly, our multiresolution image processing 
method is extremely robust in not requiring prior knowledge of either an ideal 
image or noise level. 

COLOR MODULATION 

Commonly used color models are of three kinds: 

(1) Computer Graphics Color Space 

CMY: Cyan, Magenta and Yellow 

HLS: Hue, Lightness and Saturation 

HSV (HSB): Hue, Saturation and Value (Brightness) 

RGB: Red, Green and Blue 

(2) TV Broadcast Signal Color Space 

YCrCb: Intensity, Color-red and Color-blue 
YIQ: Luminance, In-Phase and Quadrature 

(3) The CIE (Commission Internationale de Eclairage) Uniform Color Space 

CIEXYZ: Standard primaries X, Y and Z 
CIELab: Luminance, a value and b value 
CIELuv: Luminance, u value and v value 
CIExyY: x-y coordinate and Luminance 

A color vector is usually represented by its three components C=[R, G, B] 
of red, green and blue primary signals (Figure i), with each signal being 
represented with 8-bit precision (i.e. an integer range of [0, 255]). In color cathode 
ray tube (CRT) monitors and raster graphics devices, the primarily used color 
model is red, green, and blue (RGB). It is also by far the most commonly used 
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model for computer monitors. The Cartesian coordinate system is employed in this 
model. The RGB primaries are additive, such that individual contributions of each 
primary are added for the creation of a new color. The model is based on the tri- 
stimulus theory of vision and is a hardware-oriented model. 

This model can be represented by the unit cube which is defined on R, G, 
and B axes (Figure 2). The line that runs from Black (0,0,0) to White ( 1 , 1 , 1 ) is the 
gray scale line [56]. The values of R, G and B should be the same (R=G=B) in 
order to have an achromatic (colorless) pixel. A full view of a colored RGB cube 
can be seen here. The RGB color model is additive, so that: 

Red + Green= Yellow 

Red + Blue = Magenta (2) 
Green + Blue= Cyan 
Red + Green + Blue = White 
The color cube can also be projected as a hexagon as Figure 3, with the 
lightest point in the middle. This alternative representation is useful for 
understanding the relationship to other proposed color models. For example, it 
helps to visualize the color transformations between several other color models 
(e.g. CMY, HSV, HLS, etc.). With the use of a hexagon, a color RGB cube can be 
represented [56]. 

Let RL, GL, and BL, be values of three color primaries that are equal to (or 
proportional to) the measured luminance of the color primaries. It should be noted 
that these primaries cannot be displayed directly because most monitors exhibit 
a nonlinear relationship between the input value of the color signal and the 
corresponding output luminance. In fact, the nonlinearity for a particularly primary 
can be approximated by a power law relationship: 

Q<7,C=[*, G,5] (3) 
where C is the input value of the primary, C L is the luminance of the primary, and 
yxs a value that usually falls between 2 and 3, depending on the monitor used. The 
nonlinearity can also be characterized by more complicated models. In order to 
account for this nonlinearity, a transform on the linear color primaries of the form 
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R=R L *' r 9 G=G L i/r , B=B L "r. (4) 
is typically performed as display on the given monitor. The gamma-corrected 
coordinates are also used to characterize many device-independent color spaces. 
One such color space is SMPTE RGB color space, which has been selected as a 
television standard. The value of ^for this color space is 2.2. All the color images 
used in this work are taken to be in SMPTE gamma-corrected RGB coordinates. 

Since the human vision system (HVS) perceives a color stimulus in terms 
of luminance and chrominance attributes, rather than in terms of R, G, B values, 
we propose to transform the image to a luminance-chrominance space prior to 
performing the quantization. The color representation scheme used In this work 
is the CCIR-601-1 eight-bit digital coding standard, comprising a luminance (Y) 
and chrominance (Cb/Cr) components, with black at luma code 16 and white at 
luma code 235 (instead of the full 8-bit range of 0 to 255). To this end, we pick the 
YCrCb component color space that is related to the SMPTE RGB space by a 
simple linear transformation. Assuming that R, G, B occupy the range 0-255, the 
transform is given by 

r-0.298993/J+0.587016G+0.1 139915 

Cr=128+0.713267(/?-y) (5) 

C6=128+0.564334(5-y), 
where the Y, Cr, and Cb values have been scaled to the range 0-255. Hence forth, 
3D color vectors will be assumed to be in YCrCb coordinates. Since YCrCb is a 
linear transform of a gamma-corrected RGB space, it is also a linear gamma- 
corrected space: Y is the gamma-corrected luminance component representing 
achromatic colors, the Cr coordinate describes the red-green variation of the color, 
and the Cb coordinate describes the yellow-blue variation of the color. The 
conversion to YCrCb is one of the key factors enabling us to achieve high image 
quality. 

Digital grayscale images typically contain values that represent so-called 
gamma-corrected luminance Y. Here Y is a power function of luminance, with an 
exponent of around 1/2.3. Three gamma-corrected numbers, red, green, and blue 
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components (RGB), are used to represent each pixel in the case of color images. 
In the case of a gray image, each pixel is represented by a single brightness. In this 
treatment, each pixel is transformed from the original color representation (for 
example RGB) to a color representation that consists of one brightness signal and 
two color signals (such as the well-known TV broadcast color space YCbCr). 



Table 1. Display resolution comparison 



Display 


Resolution 


Distance 


DVR 




(pixels/inch) 


(inches) 


(pixels/degree) 


Computer Display 


72 


12 


15.1 


Low Quality Printing 


300 


12 


.62.8- 


High Quality Printing 


1200 


12 


251.4 


HDTV 


48 


72 


60.3 



VISUAL SENSITIVITY OF WAVELET COEFFICIENTS 

The visibility of wavelet transform coefficients will depend upon the display 
visual resolution [49] in pixel/degree. Given a viewing distance Vin inches and a display 
resolution d in pixel/inch, the effective display visual resolution (DVR) R in pixel/degree 
of visual angle is 

R=dVtaii(rfl 80>wrfK/57.3 (6) 
A useful measurement^ that the visual resolution is the viewing distance in pixels (dV) 
divided by 57.3. Table 1 provides some illustrative examples. For example, the HDTV 
assumes 1 152 active lines at a viewing distance of 3 picture heights [49]."- 

On each decomposition level, wavelet coefficients are divided into four 
subblocks, LL, HL, LH, and HH. The detailed subblocks HL, LH and HH represent three 
different decomposition orientations, horizontal, vertical, and diagonal, respectively. As 
the layer increases, the bandwidth of the equivalent subband filters decreases by a factor 
of two, and thus the frequency resolution doubles. Correspondingly, the space (time) 
resolution (display resolution) decreases by a factor of two. A subblock of wavelet 
coefficients corresponds to a spatial frequency band. For a display resolution of R 
pixel/degree, the spatial frequency / of level j is 

f=2 J R (7) 

As discussed in section I, the display gamma is 2.3. The R, G, B true color image 
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is modulated into Y, Cr, Cb channels. The model thresholds for three-color channels are 
different from each other. For example, the just-noticeable quantization threshold of Y 
is generally about a factor of two below that of Cr, which is in turn about a: factor of two 
below the Cb curve at each spatial frequency. Note that this difference usually declines 
at higher spatial frequency [49]. The Cb curve is somewhat broader than Y or Cr. This 
broadening is likely due to the intrusion of a luminance-detecting channel at high 
frequencies and high contrasts. Because the Cb color axis is not orthogonal to the human 
luminance axis, the Cb colors do have a luminance component 

The contrast sensitivity declines when the spatial frequency increases, whereas, 
the size of stimuli decreases. This mode is adapted to construct the perceptual lossless" 
response magnitude for normalizing in subblock (j t m) according to the visual response. 

VISUAL GROUP NORMALIZATION 

The main objective of wavelet signal filtering is to preserve important signal 
components, and efficiently reduce noisy components. For perceptual images, it is most 
important to protect the signal components (always represented by the 
luminance/chrominance levels) which are sensitive to human eyes. To achieve this goal, 
we utilize the magnitudes of the filter response and the human vision response. 

Magnitude Normalization 

Wavelet coefficients can be regarded as the output of the signal passing through 
equivalent decomposition filters (EDF). The responses of the EDF are the combination 
of several recurrent subband filters at different stages [19, 28, 34, 37, 3M0, 41]. The 
EDF amplitudes of various sub-blocks differ greatly. Thus, the magnitude of the 
decomposition coefficients in each of the sub-blocks cannot exactly reproduce the actual 
strength of the signal components. To adjust the magnitude of the response in each block, 
the decomposition coefficients are re-scaled with respect to a common magnitude 
standard. Thus EDF coefficients, C m {k\ in block m should be multiplied with a 
magnitude scaling factor, A m , to obtain an adjusted magnitude representation. We choose 
this factor to be the reciprocal of the maximum magnitude of the frequency response of 
the equivalent wavelet decomposition filter on node Q\m) 



WO 99/46731 



PCT/US99/05426 



141 



This idea was recently extended to Group Normalization (GN) of wavelet packets for 
signal processing [37, 38, 40, 41] and was presented to lead the optimal performance. 
Perceptual Lossless Normalization . J 

Because an image can be regarded as a signal source based on the "human visual 
system, using a just-noticeable distortion profile we can efficiently remove the visual 
redundancy from decomposition coefficients and normalize them with respect to the 
standard of perceptual importance. A mathematical model for perception efficiency, 
based on the amplitude nonlinearity in different frequency bands, has been presented by 
Watson, et al. [49], which can be used to construct the 8 "perceptual lossless" response 
magnitude Y Jmv for normalizing visual response in different luminance/chrominance 
spaces. We extend the definition to luminance/chrominance mode according-to 



where a v defines the minimum threshold, k v is a constant, R v is the Display Visual 
Resolution (DVR),/ 0 v is the spatial frequency, and d m v is the directional response factor, 
in each luminance/chrominance channel v. 

The parameters c/^ v , and d m ^represent the thresholds for orientations LL and HL 
as frequency shifts relative to threshold for orientations LH and HL. From the nature of 
dyadic wavelets, the orientation LL possesses a spectrum that is approximately a factor 
of two lower in spatial frequency than orientation LH or HL. This would suggest a factor 
°f ^u.v = 2. However, in a vision system response mechanism, at orientatioirLL the signal 
energy is spread over all orientations, which implies less visual efficiency than when the 
energy is concentrated over a narrow range, as is the case for the spectra of orientation 
HL or LH. Thus, the threshold should be increased, which can be realized by a slight 
reduction in^. 

The final value for is therefore less than 2. 

For orientation HH, similar effects exist. First, the Cartesian splitting of the 

spectrum makes the spatial frequency of orientation HH about a/2 higher than that of the 
orientations HL, and LH. Thus d HHy can be taken as d mv =2' xa . As we mentioned before, 
the spectrum along orientation HH is spread over two orthogonal orientations (45° and 
135°), which should result in a log threshold increase of about 2 1 ' 4 (a shift of 2' m ) or a 




(9) 
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total prediction of d HH iV =2* 3/4 =0.59. Finally, the well-known oblique effect [49], will cause 
a final small amount of threshold elevation. 

The chromatic channels Cb and Cr are usually each down-sampled by a factor of 
two in both horizontal and vertical directions, because the sensitivity of human vision to 
5 chromatic variation is weaker than it is to luminance variation. In the calculation of the 

lossless quantization matrix, we should in this case make some correction. For example, 
if the display visual resolution is known, as the chroma that is down-sampled by two in 
each dimension, the corrected value will be half of that. This corrected value is used to 
adjust the perceptual lossless normalization in Cr and Cb channels. 

10 

Visual Sensitivity Normalization 

Visual sensitivity in each luminance/chrominance channel v is defined as the 
inverse of the contrast required to produce a threshold response [9], 

S=VC V (10) 
15 where C v is generally referred to simply as the threshold. The Michelson definition of 

contrast, 

Cv~{Lmax. v~L mean} )IL meanv (11) 
is used, where and L mean v refer to the maximum and mean luminances of the 
waveform in luminance/chrominance channel v. Sensitivity can be thought of as a 

20 gain, although various nonlinearities of the visual system require caution in the use of 

this analogy. The variations in sensitivity as a function of light level are primarily due 
to the light-adaptive properties of the retina and are referred to as the amplitude 
nonlinearity of the HVS. The variations as a function of spatial frequency are due to 
the optics of the eye combined with the neural circuitry; these combined effects are 

25 referred to as the contrast sensitivity function (CSF). Finally, the variations in 

sensitivity as a function of signal content referred to as masking, are due to the post- 
receptoral neural circuitry. 

Combining the perceptual lossless normalization, the visual sensitivity 
normalization and the magnitude normalized factor A j m we obtain the perceptual 

30 lossless quantization matrix Q Jm v 

This treatment provides a simple human- vision-based threshold technique for the. 
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restoration of the most important perceptual information in an image. We call the 
combination of the above-mentioned three normalizations Color Visual Group 
Normalization (CVGN) of wavelet transform coefficients. 

5 RESULTS OF EXAMPLE APPLICATIONS 

Our study suggests that the analyzing functions presented in this paper can 
improve the visualization of features of importance to color images. As discussed earlier, 
the color channel application of the Color Visual Group Normalization (VGN) technique 
is designed to improve the performance of our interpolating Lagrange wavelet transform 

10 [4 1 ] in image processing. Utilizing Visual Group Normalization, the raw magnitudes of 

the transform coefficients in different luminance and chrominance channels can be 
normalized to represent exactly the visual perceptual strength of signal components in 
each subband. Moreover, the efficient non-linear filtering method — softer logic masking 
(SLM) technique [38], provides robust edge-preservation for image restoration, and 

15 removes the haziness encountered with the commonly used hard-logic filtering 

techniques. 

In our study, the original benchmark color photo of Lena was cropped to a matrix 
size of 512x512. Another color picture is digitized by Kodak digital camera with matrix 
size 512x768. The original images possess clear edges, strong contrast and brightness. 

20 Figure 4(a) is the typical noisy Lena image degraded by adding Gaussian random noise. 

A simple low-pass filter smoothes out the noise but also degrades the image resolution, 
while a simple high-pass filter can enhance the texture edges but will also cause 
additional distortion. We choose 2D half-band Lagrange wavelets as the multiresolution 
analysis tools for image processing. 

25 The median filtering (with a 3x3 window) result of Lena is shown in Figure 4(b), 

which is edge-blurred with low visual quality. The speckled noise has been changed into 
bumps. It is evident that our Color Visual Group Normalization technique yields better 
contrast and edge-preservation results and provides a more natural color description of 
this image (Figure 4(c)). 

30 For a second example, the noisy picture, result of median filtering and our result 

are shown in Figure 5(a), 5(b) and 5(c), respectively. Our previous conclusion pertain, 
showing that the CVGN technique is also suitable for image processing of pictures* 
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having a different height-width ratio. • 
CONCLUSIONS 

Blind image restoration and de-noising are very difficult tasks. Because of the 
5 complex space-frequency distribution and statistical feature of images, there is almost no 

definite feature discrimination between image and noise background. Even if prior 
knowledge of the noise is available, a perfect extraction is theoretically impossible. The 
key problem in signal processing is how to restore images "naturally" from a noisy 
background or from an encoded quantization space. For a de-noising scheme based on 
10 statistical properties, an adaptive signal-dependent transform is needed to generate a 

concentrated representation of signal components. This is based on the assumption that 
noise or clutter components in the signal subspace are relatively scattered with lower 
magnitudes. 

The logrithmic frequency band distribution of our wavelet decomposition is 

1 5 matched well with the characteristics of human vision responses to obtain compact signal 

representation. Both multiorientation and multiresolution are known features of the 
human visual system. There exist cortical neurons that respond specifically to stimuli 
within certain orientations and frequencies. The 2-D wavelet decomposition mimics the 
cortex filtering of the HVS. Visual group normalization is proposed to normalize the 

20 frequency response so that the wavelet coefficients can exactly represent the perceptual 

strength of signal components. The color channel based VGN technique results in a 
thresholding method that is efficient for visual feature extraction. 

We also employ a modified version of a well-known threshold method (termed 
the Perceptual Softer Logic Masking (PSLM) technique [38, 40]) for image restoration, 

25 which dealing with extremely noisy backgrounds. This technique better preserves the 

important visual edges and contrast transition portions of an image than does the 
traditional method, and it is readily adaptable to human vision. The smooth transition 
around the cut-off region of the filter can efficiently remove the Gibbs' oscillation in the 
signal reconstruction, and decrease the appearance of ringing and aliasing; A symmetric 

30 extension decomposition [6] is utilized to remove non-continuous boundary effects. The 

interpolation algorithm made possible by use of an interpolating wavelet transform [12, 
14, 15, 16, 25, 35, 36, 41] enables us to utilize a parallel computational strategy that is 
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convenient for real-time implementation. 

To some extent, color visual group normalization provides a measurement for 
handling raw filter coefficients in a wavelet transform. The hierarchical filtering of 
Softer-logic Masking can protect more signal components than a simple single layer 
5 nonlinear method. The concept of Visual Lossless Quantization (VLQ) presented in [49] 

can lead to a potential breakthrough compared to the traditional Shannon rate-distortion 
theory in information processing. We have compared our approach with the commonly 
used median filtering method for de-noising. The results show that the proposed method 
is robust and provides the best quality for color image filtering of which we are aware. 
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MAMMOGRAM ENHANCEMENT USING GENERALIZED SINC WAVELETS 
INTRODUCTION 

Interpolating Distributed Approximating Functionals (DAFs), designed as a set 
of envelop-modulated interpolants, are generated by Gaussian-modulated Sine, Hermit, 
or Lagrange functionals [15, 31, 40, 41]. Such DAFs are smooth and decaying in both 
time and frequency representations and have been used for numerically solving various 
linear and nonlinear partial differential equations with extremely high accuracy and 
computational efficiency. Examples include DAF-simulations of 3-D reactive quantum 
scattering, the Kuramoto-Sivashinsky equations describing flow pattern dynamics for a 
circular domain, the sine-Gordon equation near homoclinic orbits, and "a 2-D Navier- 
Stokes equation with non-periodic boundary conditions. Because the interpolating core 
shell of the fundamental DAF is the interpolating Sine functional (the ideal low-pass 
filter), it is naturally useful to construct interpolating wavelets from these DAFs for use 
in signal processing. 

The theory of interpolating wavelets has attracted much attention recently [1,8, 
10, 11, 12, 19, 20, 26, 27, 30, 31, and 32]. It possesses the attractive characteristic that 
the wavelet coefficients are obtained from linear combinations of discrete samples rather 
than from traditional inner product integrals. Mathematically, various interpolating 
wavelets can be formulated in abiorthogonal setting and can be regarded as an extension 
of the auto-correlation shell wavelet analysis [26], and halfband filters [1]. Harten has 
described a kind of piecewise polynomial method for biorthogonal interpolating wavelet 
construction [12]. Swelden independently develops this method as the well-known 
interpolating "lifting scheme" theory [32], which can be regarded as a special case of the 
Neville filters [19], Unlike the previous method for constructing the biorthogonal 
wavelets, which attempts to explicitly solve the relevant coupled algebraic equations 
[ 1 0] , the lifting scheme enables one to construct a custom-designed biorthogonal wavelet 
transform just assuming a single low-pass filter (such as DAF filter or scaling function) 
without iterations. 

Generally speaking, the lifting-interpolating wavelet theory is closely related to 
the finite element technique in the numerical solution of partial differential equations, the 
subdivision scheme for interpolation and approximation, multi-grid generation and 
surface fitting techniques. The most attractive feature of the approach is that discrete 
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samplings are made identical to wavelet multiresolution analysis. Without any pre- 
conditioning or post-conditioning processes for accurate wavelet analysis, the 
interpolating wavelet coefficients can be implemented using a parallel computational 
scheme. 

5 In this paper, the Sinc-DAF is employed to construct new biorthogonal 

interpolating wavelets — DAF wavelets and associated DAF-filters specifically for use 
in mammogram decomposition. Two kinds of different biorthogonal interpolating DAF 
wavelets (B-spline-enveloped Sine wavelets and Gaussian-enveloped DAF wavelets), as 
the examples of the generalized interpolating Sine wavelets, have been discussed 

1 0 thoroughly. Because the finite length cutoff implementation of the Sine (ideal low-pass) 

filter causes the Gibbs oscillations, the key idea for the DAF wavelet construction is to 
introduce bell-shaped weighted windows to improve the characteristics of localization 
in time-frequency plane, and to ensure perfectly smooth and rapid decay. 

To obtain excellent reconstruction quality of a digital mammogram, human visual 

1 5 sensitivity is utilized to construct the visual group normalization (VGN) technique, which 

is used to re-scale the wavelet decomposition coefficients for perceptual adapted 
reconstruction according to human perception. Softer Logic Masking (SLM) is an 
adjusted de-noising technique [29], derived to improve the filtering performance of 
Donoho's Soft Threshold method [9]. The SLM technique more efficiently preserves 

20 important information and edge transition in a manner particularly suited to human visual 

perception. A nonlinear contrast stretch and enhancement functional is easily realized for 
wavelet multiscale gradient transformation and feature-sensitive image reconstruction, 
which enables us to obtain accurate space- localization of the important features of the 
mammogram. The combined techniques can greatly improve the visualization of low- 

25 contrast components of a mammogram, which is important for diagnosis. Additionally, 

interpolating DAF-wavelet image processing can be implemented as an interpolation 
method. The coefficient calculations therefore only involve simple add/multiply 
operations. This is extremely efficient for fast implementation. 

As an example application of Sinc-DAF wavelets, we select mammogram image 

30 processing, de-noising and enhancement, because of its huge data size, complicated 

space-frequency distribution and complex perceptual dependent characteristics. 
Perceptual signal processing has the potential of overcoming the limits of the traditional- 
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Shannon Rate-Distortion (R-D) theory for perception-dependent information, such as 
images and acoustic signals. Previously, Ramchandran, Vetterli, Xiong, Herley, Asai, and 
Orchard have utilized a rate-distortion compromise for image ; -compression 
implementation [14, 23, 24, and 42], Our recently derived Visual Group Normalization 
(VGN) technique [3 1 ] can likely be used with the rate-distortion compromise to generate 
a so-called Visual Rate-Distortion (VR-D) theory to improve image processing further. 



GENERALIZED SINC WAVELETS 
Sine Wavelets 

The ^-band-limited Sine function, 

<p(x)=s\n(7Dc)l(nx)Cr (1) 
in Paley-Wiener space, constructs the interpolating functionals. Every band-limited 
function fdL\R) can be reconstructed by the equation 

where the related wavelet function — Sinclet is defined as (see Figurel) 

, , sin n (2x - 1) - sin x(x - 1 / 2) 

* x) ^171) (3) 

The scaling Sine function is the well-known ideal low-pass filter which possesses 
the filter response 



//(*>) = 



1, \a>\rr/2 ^ 
0, 7t/2<\o>\£x 



Its impulse response can be described as 

=sin{7An)/(nk) (5) 
The so-called half-band filter only possesses non-zero impulses at odd integer 

samples, A(2JH-1), while at even integers, /i[2A:]=0, except for fc=0. 

However, this ideal low-pass filter can not be implemented physically. Because 

the digital filter is an IIR (infinite impulse response) solution, its digital cutoff FIR (finite 
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impulse response) implementation will introduce Gibbs phenomenon (overshot effect) 
in Fourier space, which degrades the frequency resolution. 

The explicit compactly supported Sine scaling function and wayetet, as well as 
their biorthogonal dual scaling function and wavelet, are shown in Figure 3". We find that 
5 the cutoff Sine has decreased regularity, which is manifested by a fractal-like behavior, 

which implies poor time localization. 

B-Spline Sine Wavelets 

Because the ideal low-pass Sine wavelet can not be implemented "ideally" by FIR 
10 (finite impulse response) filters, a windowed weighting technique is introduced here to 

eliminate the cutoff singularity, and improve the time- frequency localization of the Sine 
wavelets. 

First, we define a symmetric Sine interpolating functional shell as 

is jw-steia (6) 

Multiplying by a smooth window, which vanishes gradually at the exact zeros of 
the Sine functional, will lead to more regular interpolating wavelets and equivalent 
subband filters (as shown in Figure 4 and 5). 
20 For example, we select a well-defined B-spline function as the weight window. 

Then the scaling function (mother wavelet) can be defined as an interpolating B-spline 
Sine functional (BSF) 

p N {xfjf) sin(«r/2) (7) 
25 ^ 

where N is the B-spline order, tj is the scaling factor to control the window width. To 
ensure coincidence of the zeroes of the B-spline and the Sine shell, we set 

2M+1=tx(AT+1)/2 (8) 
To ensure the interpolation condition, A(2/:)=0, ktO, it is easy to show that when the B- 
30 spline order N=4k+\ , 7 can be any odd integer (2fcH ). If N is even integer, then rj can 

only be 2. When N=4k- 1 , we can not construct the interpolating shell using the definition 
above. The admission condition can be expressed as 
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(17 = 2, N-2i 



(9) 



From the interpolation relation 

: = 0 



0 



(10) 



and the self-induced two-scale relation 
it is easy to verify that 

h(k)=^kI2)ll, *=-2Af+l, 2M-1 (12) 



GAUSSIAN-SINC DAF WAVELETS 

15 We can also select a class of distributed approximating functional, e.g., the 

Gaussian-Sine DAF (GSDAF) as our basic scaling function to construct interpolating 
scalings, 

<*mW = ^(x)/ > (x) 

sin(^/2) 

_jr a (x)— — (13) 

20 

where H^x) is a window function which is selected as a Gaussian, 

( 14 ) 

25 Because it satisfies the minimum frame bound condition in quantum physics, it will 

improve the time-frequency resolution of the Windowed-Sine wavelet. Here cris a 
window width parameter, and P(x) is the Sine interpolation kernel. The DAF scaling 
function has been successfully introduced as an efficient and powerful grid method for 
quantum dynamical propagations [40]. The Gaussian window in our DAF-wavelets 

30 efficiently smoothes out the Gibbs oscillations, which plague most conventional wavelet 

bases. The following equation shows the connection between the B-spline function and 
the Gaussian window [34]: 
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(15) 



cr=7V(tf+l)/12 (16) 



the Gaussian Sine wavelets generated by the lifting scheme similar to the B-spline Sine 
wavelets. Usually, the Gaussian Sine DAF displays a slightly better smoothness and rapid 
decay than the B-spline Lagrange wavelets. 

If we select more sophisticated window shapes, the Sine wavelets can be 
generalized further. We shall call these extensions Bell-windowed Sine wavelets. The 
available choice can be the different kinds of the popularly-used DFT (discrete Fourier 
transform) windows, such as Bartlett, Hanning, Hamming, Blackman, Chebechev, and 
Besel windows. 



VISUAL GROUP NORMALIZATION 

Filterbank Magnitude Normalization 

On each decomposition level, 2-D wavelet coefficients are divided into four sub- 
blocks, LL, HL, LH, and HH. As usual, L and //represent the low-pass and high-pass 
subband filtering results, respectively. For example, HL means the signal passes the 
horizontal high-pass filter first and passes the vertical low-pass filter. Obviously, the 
detailed sub-blocks HL, LH and HH represent the multiscale difference operation in three 
different decomposition orientations, horizontal, vertical, and diagonal, respectively. At 
each level of analysis, the bandwidth of the equivalent subband filters decreases by a 
factor of two. 

Wavelet coefficients can be regarded as the output of the signal passing through 
equivalent decomposition filters (EDF). The responses of the EDF are the combination 
of several recurrent subband filters at different stages [19, 28, 34, 37, 38, 40, 41]. The 
EDF amplitudes of various sub-blocks differ greatly. Thus, the magnitude of the 
decomposition coefficients in each of the sub-blocks cannot exactly reproduce the actual 
strength of the signal components. To adjust the magnitude of the response in each block, 
the decomposition coefficients are re-scaled with respect to a common magnitude- 
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standard. Thus EDF coefficients, C jm (k) y in block (j,m) should be multiplied with a 
magnitude scaling factor, A Jm , to obtain an adjusted magnitude representation. Here j 
represents the decomposition layer, and m denotes the different orientation block (LL, 
LH, HL or HH). 

5 We choose the normalizing factor to be the reciprocal of the maximum magnitude 

of the frequency response of the equivalent wavelet decomposition filter on node Q\m) 

1 0 Thus, the magnitude normalized coefficients, NC j m (k) t are defined as 

"CjM-^Jk) (18) 

This idea was recently extended to Group Normalization (GN) of wavelet packets for 
15 signal processing [37, 38, 40, 41] and was shown to yield the optimal performance. 

Filterband magnitude normalization (FMN) unifies the coefficient strength in 
each of the subblocks. However, an image is a perceptual signal source. Coefficients with 
equal magnitude (after FMN) in different frequency channels result in greatly different 
visual gain (sensitivity) for human eyes. Additional adjustments of the wavelet 
20 coefficients are required for visual image processing. 

Perceptual Lossless Normalization 

The reconstruction visibility of wavelet transform coefficients will-depend upon 
the display visual resolution [39] in pixel/degree. Given a viewing distance Fin inches 
and a display resolution d in pixel/inch, the effective display visual resolution (D VR) R 
25 in pixel/degree of visual angle is 

R=dV\an(x/l 80)*<W57.3 (1 9) 

The visual resolution is the viewing distance in pixels (dV) divided by 57.3. 

As mentioned above, when the decomposition layer increases, the bandwidth of 
the equivalent subband filters decreases by a factor of two: Thus the frequency resolution 
30 doubles. Correspondingly, the space (time) resolution (display resolution) decreases by 

a factor of two. A sub-block of wavelet coefficients corresponds to a spatial frequency 
band. For a display resolution of R pixel/degree, the spatial frequency/ of level j is 
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f=Z j R 



(20) 



For a Y-channel gray-scale mammogram image, the just-noticeable quantization 
threshold of Y is generally different at each spatial frequency. The contrast sensitivity 
declines when the spatial frequency increases (whereas, the size of the stimuli decreases). 
This fact is used to construct the "perceptual lossless" response magnitude for 
normalization in subblock Q\m) according to the visual response. 

For images based on the human vision system (HVS), using a just-noticeable 
distortion profile, we can efficiently remove the visual redundancy from decomposition 
coefficients and normalize them with respect to the standard of perceptual importance. 
A simple mathematical model for perception efficiency, based on the amplitude 
nonlinearity in different frequency bands, has been presented in [39], which can be used 
to construct the "perceptual lossless** response magnitude Y jm for normalizing visual 
response in different luminance/chrominance spaces. We extend the definition to 
luminance/chrominance modes according to 



where a defines the minimum threshold, k is a constant, R is the Display Visual 
Resolution (DVR) 9 f Q is the spatial frequency, and d m is the directional response factor 
for each subblock. 

The parameters and d HH represent the thresholds for orientations LL and HL 
as frequency shifts relative to the threshold for orientations LH and HL. From the nature 
of dyadic wavelets, the orientation LL possesses a spectrum that is approximately a factor 
of two lower in spatial frequency than orientation LH or HL. This would suggest a factor 
ofd u ~2. However, in a vision system response mechanism, at orientation LL the signal 
energy is spread over all matrix elements, which implies less visual efficiency than when 
the energy is concentrated over a narrow range, as is the case for the spectra of 
orientation HL or LH. Thus, the threshold should be increased, which can be achieved 
by a slight reduction in f Q . The final value for d u is therefore less than 2. * 

For orientation HH, similar effects exist. First, the Cartesian splitting of the 

spectrum makes the spatial frequency of orientation HH about higher than that of 
the orientations HL, and LH. Thus d HH om be taken as d HH =2~ m . As we mentioned before, 




(21) 
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the spectrum along orientation HH is spread over two orthogonal orientations (45° and 
135°), which should result in a log threshold increase of about 2 1 ' 4 (a shift of 2" l/4 ) or a 
total threshold factor of rf /w =2" 3/4 =0.59. Finally, the well-known oblique. effect [39], will 
cause a small amount of threshold elevation. 

Visual Sensitivity Normalization 

Visual sensitivity is defined as the inverse of the contrast required to produce a 
threshold response [9], 

S=\/C (22) 
where C is generally referred to simply as the threshold. The Michelson definition of 
contrast, 

CHL m ^ mean )IL mean (23) 
is used, where L max and L mean refer to the maximum and mean luminances of the 
waveform in a luminance channel. Sensitivity can be thought of as a gain, although 
various nonlinearities of the visual system require caution in the use of this analogy. The 
variations in sensitivity as a function of light level are primarily due to the light-adaptive 
properties of the retina and are referred to as the amplitude nonlinearity of the HVS. The 
variations as a function of spatial frequency are due to the optics of the eye combined 
with the neural circuitry; these combined effects are referred to as the contrast sensitivity 
Junction (CSF). Finally, the variations in sensitivity as a function of signal content, 
referred to as masking, are due to the post-receptoral neural circuitry. 

Combining the perceptual lossless normalization, the visual- sensitivity 
normalization and the magnitude normalized factor^, we obtain the perceptual lossless 
quantization matrix Q J m 

Qj^lCYjJA^ (24) 
This treatment provides a simple human-vision-based normalization technique for the 
restoration of the most important perceptual information in a mammogram image. We 
call the combination of the above-mentioned three normalizations the Visual Group 
Normalization (VGN) of wavelet transform coefficients. 

IMAGE PROCESSING TECHNIQUES 
Softer Logic Masking 
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The main objective of wavelet signal filtering is to preserve important signal 
components, and efficiently reduce noise components. 

After visual group normalization processing, coefficients have been normalized 
according to human visual sensitivity. An additional filtering algorithm is required for 
5 de-noising and removing perceptual redundancy. Hard logic masking is the commonly 

used nonlinear processing technique. It is similar to a bias estimated dead-zone limiter. 
Jain [14] has shown that a non-linear dead-zone limiter can improve the SNR for weak 
signal detection. It can be expressed as 



10 r,(y) = sgn(y){\y\-S)>il<f!*\ < 25 ) 

where ^is a threshold value, and y is the measurable value of the coefficient. Donoho 
shows that the>#=l case of the above expression is a near optimal estimator for adaptive 
NMR data smoothing and de-noising [11]. Independently, we utilized the hard logic 
15 masking to efficiently extract a target from formidable background noise in a previous 

work [26, 27, 28]. 

Various threshold cutoffs of multi-band expansion coefficients in hard logic 
masking methods are very similar to the cutoff of an FFT expansion. Thus, the Gibbs 
oscillations associated the FFT will also occur in the wavelet transform using a hard logic 

20 masking. Although hard logic masking methods with appropriate threshold values do not 

seriously change the magnitude of a signal after reconstruction, they can cause 
considerable edge distortions in a signal due to the interference of additional high 
frequency components induced by the cutoff. The higher the threshold value, the larger 
the Gibbs oscillations will be. 

25 Edges are especially important for feature preservation and precise localization 

for images and biomedical signals. We here present a Softer Logic Masking (SLM) 
method. In our SLM approach, a smooth transition band near each masking threshold is 
introduced so that any decomposition coefficients, which are smaller than the threshold 
value, will be reduced gradually to zero rather than be set to -zero. This treatment 

30 efficiently suppresses Gibbs oscillations and preserves signal edges, and consequently 

improves the quality of the reconstructed signal. Our SLM method can be expressed as 
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C,. w <*) - sgn(C^(A)Xl C Jtm (k)- S\)? SOFT(NC Jtm (k)) 



(26) 



where the C j m {) are the decomposition coefficients to be retained in the reconstruction 



and the quantity NCj,m{k) is defined as 



max {\NC Jjm (k)\} 



(27) 



The softer logic mapping, SOFT:[Q,l]-+[Q,l], is a non-linear monotonically increasing 
sigmoid functional. A comparison of the hard and softer logic masking functional is 
depicted in Figure 7. 

The softer logic functional can also be taken as the alternated form 



(28) 



where £ is a normalized adaptive threshold. For an unknown noise level, an 
approximation to ^is given as 



(29) 



where a is a scaling factor and can be chosen as ^=1/1.349. The quantity is an 
upper frame boundary of the wavelet packet transform, i.e., the upper boundary singular 
value of the wavelet transform decomposition matrix. Using arguments similar to those 
given by Donoho [1 1], one can show that the above Softer Logic Masking reconstruction 
is a near optimal approximation in the min-max error sense. 
Device Adapted Enhancement 

The basic idea is to use gradient operators to shape flat image data so that desired 
portion of the image is projected onto a screen. 

Using visual normalization and softer-logic thresholding, one can efficiently 
remove the visual redundancy and noisy components from the decomposition 
coefficients. 

For grayscale image contrast stretching, the objective is to improve the perception . 
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capability for image components which the human visual system is initially insensitive 
but is important for diagnosis. In other words, mammogram enhancement increases the 
cancer detection probability and precision. We first appropriately, normalize the 
decomposition coefficients according to the length scale of the display device so that they 
5 fall within the interval of the device frame. Assume that the image coefficients have 

already been properly scaled by visual group normalization so that the amplitude value 
NC j m {k) falls into the dynamic range of the display device: 

rf min <ATq ffi (fc)<rf max (30) 
Without loss of generality, we consider the normalized gradient magnitude, 

10 Uj^NCjJkyid^-d^ (31) 

Mallat and Zhong realized that wavelet multiresolution analysis provides a natural 
characterization for multiscale image edges, and these can be easily extracted by various 
differentiations [15]. Their idea was extended by Laine et al [7] to develop directional 
edge parameters based on a subspace energy measurement. An enhancement scheme 

1 5 based on complex Daubechies wavelets was proposed by Gagnon et al. [9] . These authors 

made use of the difference between the real and imaginary parts of the wavelet 
coefficients. One way or another, adjusted wavelet transforms should be designed to 
achieve desired edge enhancement. 

Our starting point is taken to be the magnitude normalized or visual group 

20 normalized wavelet subband coefficients NC } Jk) [10, 12], We define an enhancement 

functional E jm 

Ej. m =a jtm +J3 Jt ^ (32) 
where A is the Laplacian and -\<a jm , fi jtB <\. The coefficients a Jm9 p jm can be easily 
chosen so that desired image features are emphasized. In particular we can emphasize an 

25 image edge of selected grain size. We note that a slight modification of a } m and m can 

result in orientation selected image enhancement. A detailed discussion of this matter will 
be presented elsewhere. An overall re-normalization is conducted after image 
reconstruction to preserve the energy of the original image. We call this procedure 
enhancement normalization. 

30 Contrast stretching is an old but quite efficient method for feature selective image 

display. Nonlinear stretching has been used by many authors [3, 7, and 16], Lu and 
coworkers [16] have recently designed a hyperbolic function 
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g/Jt)=[tanh(a/:-6)+tanh(6)]/[tanh(a-6)+tanh(6)] (33) 
for wavelet multiscale gradient transformation. Their method works well for lunar 
images. The basic idea is to use gradient operators to shape a flat image scr that desired 
portion of the image is projected into a screen window. 

ENHANCEMENT RESULT 

To test our new approaches, digital breast mammogram images are employed. 
Mammograms are complex in appearance and signs of early disease are often small 
and/or subtle. Digital mammogram image enhancement is particularly important for 
aiding radiologists in the development of an automatically detecting expert system. The 
original image is coded at 768x800 size and a 200-micron pixel edge as shown-in Figure 
8(a). As shown in Figure 8(b), there is a significant improvement in both edge 
representation and image contrast resulting from DAF-wavelets combined with our 
Visual Group Normalization (VGN) and non-linear enhancement techniques. In 
particular, the domain and internal structure of higher-density cancer tissues are clearly 
displayed. The results are characterized by high-quality image enhancement and good 
signal averaging over homogeneous regions with minimal resolution degradation of 
image details. 

CONCLUSION 

The newly developed DAF-wavelet image processing and enhancement 
techniques can improve present and future imaging performance for earlier-detection of 
cancer and malignant tumors. It improves the spatio-temporal resolution of biomedical 
images and enhances the visualization of the perceptually less-sensitive components that 
are very important for diagnosis, as well as reduces the distortion and blur. 

Different image processing techniques (distortion suppression, enhancement, and 
edge sharpening) can be integrated in a single step process or can be alternatively applied 
by different coefficient selection. Our biomedical image processing and computer vision 
research should prove important in meeting hospital needs in image enhancement, image 
analysis and expert diagnosis. 

The method presented can be applied to various types of medical images. These 
include various X-ray images, Mammography, Magnetic Resonance Imaging (MRI), - 
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Supersonic imaging, etc. Enhanced imaging of internal organs and/or other parts of the 
human body, for the detection of cancer and other diseases, is of great importance. It has 
the potential for earlier, cost-effective diagnosis and management of disease, can provide 
improved visualization of normal versus diseased tissue, and enhance the study of drug 
diffusion and cellular uptake in the brain. 

The method dramatically improves the image quality (in terms of signal to noise 
improvements and/or contrast differentiation) by de-noising and enhancement, as 
demonstratedly the digital mammogram example. 
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DUAL PROPAGATION INVERSION OF FOURIER AND LAPLACE SIGNALS 
INTRODUCTION 

In many physical and chemical phenomena, two functions are related by an integral 
equation of the form 



where K&yX*) is some sort of "Transform Kernel", and jc, x* may be conjugate variables 
(depending on the transform under consideration). For experimental or computational 
studies, the function C(x) is typically known only on a finite, discrete set of points. 
Extracting^*') from measured or calculated values of C{x) is referred to; as "inversion". 
Famous examples of this inversion problem are the Fourier and Laplace transforms. 
Although there are well-known, exact mathematical procedures for these inversions, one 
generally faces serious difficulties in numerically determining the function f(x') from 
C(x)-values known at a finite number of discrete points. In the Laplace transform, for 
example, the inversion may be unstable due to amplification of noise or other errors in 
the C(x ; ) data [1]. Inverse Fourier transforms can also suffer from slow convergence of 
the integration, requiring a large range of samples of the C(xX so that one is able to 
obtain only low resolution spectra when a sufficiently long duration time signal is not 
available [2,3]. 

A convenient procedure for numerically inverting such an integral equation is 
disclosed, with the focus here being primarily on real-time spectra. We shaU demonstrate 
the method using a simple Fourier transformation as an example, but the procedure may 
be useful for other inversion problems. Our method, termed the "dual propagation 
inversion"(DPI) method, makes use of the distributed approximating functional 
(DAF)free propagator [4] to cany out the inversion, but other numerical free propagator 
techniques (such as the Fast Fourier method [5,6]) may also be used. The Fourier 
transform is naturally invoked whenever one tries to obtain information in the 
frequency-domain from a time-domain signal. In general, for real - spectra, the 
time-domain and frequency-domain spectra are related by 




0) 




(2) 
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and 



(3) 



Typical examples of the sorts of problems of interest are the computation of absorption 
spectra or Raman spectra [7] from the autocorrelation function and the extraction of 
vibrational frequencies from a molecular dynamics simulated time signal [8]. The 
difficulty in the direct inversion by Eq. (3) is that in order to resolve close-lying 
frequencies, one needs to know the autocorrelation function (or time signal in general) 
on a very dense time grid over a long period of time [2,3], When the* time signal is 
available only on a set of discrete times and for a short time period, this approach 
becomes inefficient and inaccurate. Several elegant methods, such as the 
filter-diagonalization technique, developed by Neuhauser [9] and modified by 
Mandelshtam and Taylor [10], have been proposed for such problems. 

In the present approach, we begin by inserting a factor, exp[\a (co -<o 0 ) 2 ], into Eq. 
(2), thereby defining an auxiliary function 



(Note that Eq. (4) can be viewed as a type of Fresnel transform [1 1] an integral kernel 
K(to 9 u 0 ) = exp[ia((x>-ix> Q ) 2 ]y From that viewpoint, the kernel acts on e^co) rather than 
ony(o)).) Due to the highly oscillatory nature of the factor exp[/^co-w 0 £| for large a 



In fact, it can be shown that in the limit a - ~, dtt; f a,Ci> 0 ) becomes proportional to J[u> ). 
The factor, ex/?[i'4cd-co 0 ) 2 ], can be viewed as a window function that acts as a "filter" so 
that the auxiliary function, ^t;d,co 0 ), contains information principally at frequencies near 
o> = o 0 . We note that the factor exp[i <*(g>-g> 0 ) 2 ] also can be expressed as a matrix element 
of the "free propagator 11 , exp[(-i/4a)(d 2 /do) 2 )],according to 




(4) 



values, only frequencies sufficiently close to <d = g> 0 contribute significantly to Eq. (4). 



(5) 



and the integration over co in Eq. (4) is equivalent to evaluating the action of 
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exp[(-i/4a)(d 2 /dco 2 )] in the continuous "^"-representation. That is, the introduction of the 
phase exp[ia(u>-u> Q ) 2 ] is equivalent to freely propagating (in the sense of quantum 
propagation) the function e" iG 7((o ) for a "duration", l/(4a ), in a generalized time. This 
enables us to write Eq. (4) as 

C(t;a, U0 ) = («/a) 1 / :l ex P [-±|L]( e -"o 7K)) . ^ 
The function J{u> 0 ) t^ en * s obtained simply by inverting Eq. (6), 

This equation indicates that the value of the function J[<o 0 ) can be computed by freely 
propagating the auxiliary function <^t;a) for a "duration" l/(4a ). However, for this 
equation to be of any use, we need an independent method for calculating the auxiliary 
function from the experimentally (or theoretically) determined C(t). The function 
d(t;a,G> 0 ) can be calculated directly from the experimentally determined signal, C(t), 
using the following expression: 



. d 3 

C(t 0 ;a,«o) = e~ iu * i 'e'"' 7 $[e ,u «»C(to)]. 



(8) 



This equation is obtained by deriving (and integrating) the auxiliary partial differential 
equation, 

Thus, our procedure consists of propagating the experimental time signal, C(t) 
(multiplied by exp(ico 0 t)), from t to ^ over the"duration" a to obtain d^'A^X and then 
propagating the resulting CX^a.,^) from %w 0 to <o over the "duration" l/(4cc ). Clearly, 
the product of the two "propagation durations" is a constant (axl/4cc = 1/4), indicating 
that there exists a reciprocal relationship characteristic of a "time-energy"-like uncertainty 
principle. In the numerical example we give below, we shall choose a - l/4a = 1/2; 
however, this is not necessary and the question of the optimum "duration" for each 
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propagation remains to be explored. We also find it numerically efficient to employ the 
DAF-free propagator [4], 




where 



'» = (-l)7f(2x) 1 /V(o)n!2 2 »] ( ( n > 

and 

<r 2 {a) = <7 2 (0) + 2ia. ( 12 ) 

where, 6(0) and M are DAF-parameters, and the H 2n 's are the Hermite polynomials 
[4,12], This makes it possible to carry out the two propagations with a single 
matrix- vector multiplication for each propagation. \ It should be noted that this DAF-free 
propagator exactly propagates the DAF representation of a function. 

We believe that the dual-propagation inversion scheme we have outlined has 
several potential advantages. First, the D AF-fitting method on which the scheme is based 
allows us to effectively filter noise while preserving signal. The method provides a 
"signature" as a function of the DAF parameters which gives a measure of signal to noise 
and allows the parameters to be varied to achieve optimum filtering. Second, as already 
noted, the propagation of the DAF representation of a function using the Hermite DAF 
propagator is exact. Third, each propagation has associated with it a characteristic width 
controlled by the Gaussian (see Eqs. (10) and (12)) and determined by the optimum fit 
of the input data. These widths control on what domain oft the function C(t) must be 
known in order to determine a particular value of/(a> 0 ). Finally, DAFs can be used to 
"pad" (i.e., extend) the initial C(t) dataset in various ways to take advantage of any a 
priori knowledge one might have. 

A detailed comparison of our method with other inversion methods is beyond the 
scope of this brief report and will be the topic of a subsequent communication. \ 
However, we will illustrate the method on a challenging problem, namely extracting a 
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half-sine wave from its discretized and noisy transform. The underlying function, f(o> ), 
in our example is then 

f(u>) = 5171(0;), [0<w < ar ] | ( 13 ) 
= 0 otherwise . 

Since f(a> ) has compact support, the time-signal, C(0(analytically obtained by Eq. (2)) 
of course, cannot have compact support. In fact, it has a very slow decay with increasing 
First,/[(o) was computed by our dual propagation using Eqs. (7) and (8), and the result 
compared with the truncated sine function, Eq. (13). In Fig. 1, we present the auxiliary 
function, Ctt;a,G> 0 ), at t=0 and a=l/2, as a function of co 0 .We have calculated this 
function both by direct numerical integration of Eq.(4) and by DAF propagation using 
Eq. (8) [13]. The results of the DAF propagation and the direct numerical integration are 
essentially identical up to the fourth decimal point. The propagation to generate the 
auxiliary function has been carried out by a highly efficient"one-step DAF-propagation" 
of duration a = Vi. Figure 2compares the spectral function J[<*> X obtained by the present 
dual propagation [13] with the original truncated sine-function. It Asian that the two are 
visually indistinguishable. In this calculation, we assumed discrete "experimental" values 
of C(t) are known from -45^/^45 on a discrete grid with uniform spacing A/ 0 =l/18, and 
the propagation from o> 0 to co employs the computed values of Ctt 0 ;a,G> 0 ) from 
-30£G) 0 *35,with a uniform grid spacing of Aco 0 = 1/18. We emphasize that one only 
needs the time signal C(t) for a finite time interval, [t^t^], in order to calculate 
<3to;a,G> 0 ) us *ng Eq- (8). Although the accuracy increases for longer-time signals, C(r), 
the length of the time interval is controlled the width of the DAF free propagator matrix 
(determined by o/A/ 0 ). Thus, short-duration time signals can yield an accurate spectrum 
y(co ) for the frequencies filtered out in Eqs. (4)and (8) in our method. It should also be 
noted that the present technique does not require a knowledge of the system Hamiltoni 
an, or any diagonalization processes, since it makes direct use of the time signal (or 
autocorrelation function). Thus, the measured or computed time signals directly yield 
information in the frequency domain (e.g., optical spectra or normal mode frequencies). 

One of the features of DAFs that has been discussed previously is their ability to 
filter noise from a given function. The free-propagating DAF described here also has this 
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filtering property, which is extremely useful in the present context. When an inversion 
is carried out for experimental data, the presence of noise in the time signal C(t) will 
create inaccuracies in the calculated spectrum,y(a) ). (This problem is especially serious 
in the inverse Laplace transformation, since statistical errors present in the signal can be 
easily amplified.) In order to illustrate the filtering feature of our DPI procedure, we 
added random noise(up to ± 20\%) to the time signal C(/), and carried out the dual 
propagation inversion as before. We find that the resulting auxiliary function, Cfro^^o) 
is very similar to that obtained earlier using the noise-free time signal, confirming that 
high frequency, random noise is automatically removed by the DPI method. Figure 
3illustrates this denoising feature of the DPI, using Hermite-DAF free propagators, by 
comparing the spectrum obtained from the "corrupted" time signal (with ± 20\% noise) 
to that obtained from the noise-free time signal. Both time domain signals extend over 
the range -45sr*45,with the same sampling frequency as before. It is seen that the DPI 
almost completely removes the effects of the noise in the time signal. We also employed 
a stationary DAF filtering method [14] to remove the noise from the time signal before 
carrying out the DPI procedure, and obtained a frequency domain spectrum^a) ) that is 
visually indistinguishable from that depicted in Fig. (3). 

Finally, we explore the consequences of a more severe truncation of the 
time-domain signal, and therefore carried out the DPI procedure using C(t) over the 
artificially shortened interval -5 <; t*5. In these calculations, we have treated the noiseless 
C(t). Our results are shown in Fig. 4, where we compare the inversion result to the exact, 
original signal. We see that there is relatively little error induced (comparable to that due 
to introducing noise; see Fig. 3). A common procedure to decrease the aliasing effects 
caused by the truncation of the time domain signal is to artificially damp C(t) to zero 
beyond certain times. This is typically done by interpolation or by smoothly joining an 
exponential decaying analytical "tail". In the case of DAFs, we can add a gap on either 
side of the truncated C{t\ and use DAF-fitting to "fill in" the missing gap values, thereby 
joining the actual data to an analytical tail function. In the present study, we introduced 
the gaps 5s|t|s7.5, and joined the known data between -5ste5 to the analytical tail 



CM 



= -0.040648 exp(-0.107871 



(14) 
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The results of the DPI are compared to the original signal in Fig. 5, and we see that while 
some aliasing still exists, it is reduced over the entire range of co . Similar results are 
obtained when noisy signals are truncated, and then DAF-joined to an analytical decaying 
tail function. We conclude by noting that we expect even better results will be obtained 
by using DAFs to periodically (or in other ways) extend the truncated time domain 
signal. This will be reported elsewhere. 

We conclude by briefly considering the Laplace transform problem, namely 
numerically inverting 

*W = jf*.-«/M , (is) 

which our method (in principle) also solves. Here C(r) is analytic in the half-plane (t>0). 
In the standard way, it is convenient to introduce a new function 



/(w) = f(u>) for (j > o 
= 0 for uj < 0 

to write 



C(t)=z f <L> e- wt f(u>) 



(16) 



Exactly paralleling our previous discussion we then obtain 

7K) = (*M lf7 ^**kdtoa,»o). (17) 

where now 

C(i;a >Wo ) = J e iQ ^o)\^tj^ (jg) 

The quantity <3tt;cc,G> 0 ) obeys the equation 
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subject to the initial- value boundary condition ^t;a=0,a) 0 )=C(/). Equation (19) is 
identical to Eq.(9) except for the sign of the right-hand side. The inversion then proceeds 
by solving Eq.(19), subject to the boundary condition at a =0 and propagating the result 
via Eq.(17) to obtain the answer. Although this solves the problem in principle, the first 
step(that of solving Eq.(19) to acceptable accuracy when C(t) is imprecisely known) is 
a challenge. In the Fourier transform case, we solved the corresponding partial 
differential equation by means of a second propagation, and indeed we can still write the 
formal solution to Eq.(19) as 

(20) 

since C{i) is analytic in the half-plane (f>0) and hence its derivatives to all orders 
uniquely exist in the half-plane. However, using the D AF propagation scheme (at least 
in a straight-forward manner) presents difficulties near the origin because of the 
singularity in C(t) at /=0. Various stratagems can be devised for attacking these problems, 
and they are currently under investigation. 
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DISTRIBUTED APPROXIMATING FUNCTIONAL WAVELET NETS 
INTRODUCTION 

For real-world signal processing, pattern recognition and system identification, 
information extraction from a noisy background is the fundamental objective. To obtain 
an ideal output vector Y(X f W) from the observation input vector X y the system (neural 
network) should possess the following two kinds of smoothness [6] (where W is the 
response or the entry of the system, normally for neural networks, called the weight 
vector). 

(a) Functional space smoothness 

(b) State space smoothness 

The degree of smoothness in functional space governs quality of the filtering of the 
observed (noisy) signal. The smoother the output signal, the more noise is suppressed. 
State space smoothness implies that a weak fluctuation of the weight vector W={w(f), 
/=0, LA }, has a small effect on the output signal, which makes the system less sensitive 
to the input distortion. 

For a robust estimation system, the output not only approaches the observed 
signal value, but also smoothes the signal to suppress the distortion due to noise. 
Simultaneously, the state space should be smooth to ensure stability. Based on these 
facts, one finds that the least mean square (LMS) error 



E A =l[Y(X)-Y{lV,X)} 2 dX , (1) 



the regularization constraints of order r 

*« = J[%^, (2) 



and the condition of the system 



^ awe w 



are the three dominant factors that must be accounted for in designing a robust estimation 
system. 
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Distributed Approximating Functional (DAFs), which can be constructed as a 
window modulated interpolating shell, were introduced previously as a powerful grid 
method for numerically solving partial differential equations with extremely high 
accuracy and computational efficiency [3,9,10]. In this paper, we Use the DAF 
approximation scheme for implementing a neural network. 

Compared with other popular networks, DAF nets possess advantages in several 

areas: 

(1) a DAF wavelet is infinitely smooth in both time and frequency domains. 

(2) For essentially arbitrary order of the Hermite polynomial, the DAF shells 
possess an approximately constant shape, while commonly used wavelet functions 
always become more oscillatory as the regularization order is increased.; 

(3) The translation invariance of the DAF approximation ensures feature 
preservation in state space. The signal processing analysis can be implemented in a space 
spanned by the DAFs directly. 

(4) Complicated mathematical operations, such as differentiation or integration, 
can be carried out conveniently using the DAF interpolating shell. 

(5) The identical smoothness of the DAF wavelet space and DAF state space 
underlie the inherent robustness of the DAF wavelet nets. 

REGULARIZED DAF WAVELET NETS 

In general, signal filtering may be regarded as a special approximation problem 
with noise suppression. According to DAF polynomial theory, a signal approximation 
in DAF space can be expressed as 

£(*) = Iif<*/)* a (*-*) (4) 

where the dj^x) is a generalized symmetric Delta functional. We choose it as a Gauss 
modulated interpolating shell, or the so-call distributed approximating functional (DAF) 
wavelet. The Hermite-type DAF wavelet is given in the following equation [10]. 

SMixla)s ^l^ n it"^ w (5) 
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The function H 2n is the Hermite polynomial of even order 2n. The qualitative behavior 
of one particular Heimite DAF is shown in Fig. 1. 

The DAF wavelet neural nets possess the alternative feature of the commonly 
used DAF approximation as 

£00 = Z^'Va (*-*/) ^ 

The weights w(z) of the nets determine the superposition approximation J(x) to the 
original signal g(x)eL 2 (R). It is easy to show that the weights of the approximation nets, 
vf(0, arc closely related to the DAF sampling coefficients g(x t ). The irregular finite 
discrete time samplers of the original signal are selected for network learning. If the 
observed signal is limited to an interval /containing a total of N discrete samples, 7={0, 
1, . . iV-1}, the square error of the signal is digitized according to 

E A ^[g{n)'g{n)} 2 (7) 

rt=0 

This cost function is commonly used for neural network training in a noise-free 
background and is referred to as the minimum mean square error (MMSE) rule. 

However, if the observed signal is corrupted by the noise, the network produced 
by MMSE training causes an unstable reconstruction, because the MMSE recovers the 
noise components as well as the signal. In this case, the signal-noise-ratio (SNR) cannot 
be improved much. Even for a noise-free signal, MMSE may lead to Gibbs-like 
undulations in the signal, which is harmful for calculating accurate derivatives. Thus, for 
more robust filtering, the network structure should be modified to deal with the particular 
situation. In this paper, we present a novel regularization design of the cost function for 
network training. It generates edge-preserved filters and reduces distortion. To define the 
regularity (smoothness) of a signal, we introduce a "Lipschitz index" [6]. 

Definition 1 : Let fx)eL 2 (R), for any a>0, then if 
I f{x>M\=0(\x-yn (8) 

the signaiy(x) is said to be unified Lipschitz in the space L 2 (R). The constant, a, is the 
Lipschitz index of fix). 



WO 99/46731 



PCT/US99/05426 



179 

It is easy to show that when the Lipschitz index, a, is an integer, the Lipschitz 
regularity is equivalent to the differentiability of/[*) with same order. For commonly 
used signals, the Lipschitz index aX). In the presence of noise distortion/the Lipschitz 
index always satisfies #<-l. This is because the noise causes sudden phase changes at 
neighboring points. To eliminate non-ideal undulations, we need to preserve the signal 
trends while making a small MSE approximation. To achieve this, an additional smooth 
derivative term, E„ is introduced to modify the original cost function. The new cost 
function is then 

=ZkW-iW] 2 + ^[i^l,2^ (9) 

The factor / introduces a compromise between the orders of approximation and 
smoothness. Generally, the derivative order r>2 is used to evaluate the smoothness of the 
signals. Using the properties of the Hermite D AFs, we find that the derivative term of the 
regularized cost function E r can be expressed in a comparatively simple convolution 
form, 

^I-^-x,,.) (1Q) 



where X is termed a "differentiating DAF" and is given by 



4?(*-«la)«-£Bl 



M/2 



r cxp(^fL)l) 



(11) 



It is exactly the rth derivative of 6o. Because of the infinite smoothness of DAF 
wavelets, any order derivative can be obtained. The Hermite polynomial H is generated 
by the usual recursion 



"„W = 



1, n = 0 
2x, n = 1 

2xW„- I (x)-2(n-t)// n . 2 (x), n>\ 



(12) 
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The predominant advantage of the Hermite polynomial approximation is its high-order 
derivative preservation (which leads to a smooth approximation). 

To increase the stability of the approximation system further, -an additional 
constraint in state space is taken to be 

Em =-i 

t 

Thus the complete cost function utilized for DAF wavelet net training is given by 



K t 

SIMULATIONS 

Two biomedical signal processing applications (for electrocardiogram and 
electromyography) using the DAF wavelet neural nets are presented in this chapter. 

Automatic diagnosis of electrocardiogram (ECG or EKG) signals is an important 
biomedical analysis tool. The diagnosis is based on the detection of abnormalities in an 
ECG signal. ECG signal processing is a crucial step for obtaining a noise-free signal and 
for improving diagnostic accuracy. A typical raw ECG signal is given in Fig. 2. The 
letters P, Q, R, S, T and U label the medically interesting features. For example, in the 
normal sinus rhythm of a 12-lead ECG, a QRS peak follows each P wave. Normal P 
waves rate 60-100 bpm with <10% variations. Their heights are <2.5mm and widths 
<0.1 Is in lead II. A normal PRinterval ranges from 0.12 to 0.20s (3-5 small squares). A 
normal QRS complex has a duration of <0. 1 2s (3 small squares) . A corrected QT interval 
(QTc) is obtained by dividing the QT interval with the square root of the preceding R-R' 
interval (normally QTc=0.42s). A normal ST segment indicates no elevation or 
depression. Hyperkalemia, hyperacute myocardial infarction and left bundle can cause 
an extra tall T wave. Small, flattened or inverted T waves are usually caused by 
ischaemia, age, race, hyperventilation, anxiety, drinking iced water, LVH, drugs (e.g. 
digoxin), pericarditis, PE, intraventricular conduction delay (e.g. RBBB) and electrolyte 
disturbance [12]. 

An important task of ECG signal filtering is to preserve the true magnitudes of 
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the P, Q, R, S, T, and U waves, protect the true intervals (starting and ending points) and 
segments, and suppress distortions induced by noise. The most common noise in an ECG 
signal is AC interference (about 50Hz-60Hz in the frequency regime). Traditional 
filtering methods (low-pass, and band-elimination filters, etc.) encounter difficulties in 
dealing with the AC noise because the signal and the noise overlap the same band. As a 
consequence, experienced doctors are required to carry out time-consuming manual 
diagnoses. 

Based on a time varying processing principle, a non-linear filter [4] was recently 
adopted for ECG signal de-noising. Similar to the selective averaging schemes used in 
image processing, the ECG is divided into different time segments. A sample point 
classified as "signal" is smoothed by using short window averaging, while, a "noise" 
point is treated by using long window averaging. Window width is chosen according to 
the statistical mean and variance of each segment. However, this calculation is 
complicated and it is not easy to select windows with appropriate lengths. The 
regularized spline network and wavelet packets were later used for adaptive ECG 
filtering [5, 6], which is not yet efficient and robust for signal processing. In our present 
treatment, regularized DAF networks are used to handle a real-valued ECG signal. We 
utilize our combined group-constraint technique to enhance signal components and 
suppress noise in successive time-varying tilings. 

The raw ECG of a patient is given in Fig. 3(a). Note that it has typical thorn-like 
electromagnetic interference. Fig. 3(b) is the result of a low-pass filter smoothing. The 
magnitudes of the P and R waves are significantly reduced and the Q and S-waves almost 
disappear completely. The T wave is enlarged, which leads to an increase in the QT 
interval. Notably, the ST segment is depressed. Such a low-pass filtering result can cause 
significant diagnostic errors. Fig. 3(c) is obtained by using our DAF wavelet neural nets. 
Obviously, our method provides better feature-preserving filtering for ECG signal 
processing. 

Another application is for electromyography (EMG) filtering. Surface EMG has 
been used to evaluate muscle activation patterns in patients with gait disorders since the 
mid 1900s. In experimental as well as routine recording of muscle action potentials, 
signal cross-talk from various sources cannot always be avoided [2]. In particular EMG- 
investigations within the areas of physical science, orthopedics or ergonomics, where the 
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collection of data has to be carried out under field conditions, the measured signals are 
often incorrect due to movement of the subject. In particular DC off-set-voltages, 
movement of electrodes and cables, 50 Hz interference and electrostatic interference 
should all be considered. But even with the utmost case, movement artifacts, particularly 
in studies of movement, cannot be completely avoided. Thus for a number of quantitative 
signal processing procedures, an elimination of interference has to be carried out. 

Once the raw scanning EMG data are stored in the computer, several processing 
options are available for improving the signal quality. The first one is removing the DC 
offset per recorded trace for elimination of movement artifacts. The second option is a 
moving window smoothing (three points or more) in the time-direction of each trace for 
noise reduction. The third, a nonlinear median filtering over 3, 5 or 7 points is-employed 
in the depth direction for elimination of non-time-locked activity. The iteration times of 
median filtering are performed, depending on the quality of the recorded scan [2]. 
Although a median filter is better for impulse-like noise removal than linear filters (low- 
pass, high-pass or band-limited) [1 1], it is not optimal since it is typically implemented 
uniformly across the image. The median filter suppresses both the noise and the true 
signal in many applications. It results in amplitude reduction of the sharp signal peak, 
which is deleterious for diagnostic analysis. 

In this paper, we use a DAF wavelet neural network for the adaptive EMG 
processing. The sampling is irregular to match the time- varying characteristics of EMG. 
Additional regularization and robust designs enable the optimal smooth approximation 
of the signal. As shown in Fig.4, the original measured EMG has many thorn-like noise 
peaks (Fig.4(a)). A simple low-pass filtered result is shown in Fig.3(b). Our newly 
developed technique results in the solution shown in Fig.4(c). Again, one should note the 
presentation of feature details achieved. 

CONCLUSION 

Regularized DAF wavelet neural networks are proposed for non-stationary 
biomedical signal processing. The DAF approximation shells possess infinite smoothness 
in both physical and frequency domains, which enable the high-resolution time-varying 
analysis of the signal. The optimal signal filtering solution is obtained using a 
combination of several different contributions to a "cost function". Measured ECG and 



WO 99/46731 



PCT/US99/05426 



183 

EMG signals are employed for testing the new technique. The simulations show that our 
method is both efficient and robust for time-varying filtering. 
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PERCEPTUAL NORMALIZED SUBBAND IMAGE RESTORATION 



INTRODUCTION 

An earlier group normalization (GN) technique [1] is extended to account for human 
perceptual responses. The purpose of the GN is to re-scale the magnitudes of various 
subband filters and obtain normalized equivalent decomposition filters (EDFs). Our 
human visual system can be regarded as a natural signal and image processing devise-a 
non-ideal band pass filter. In order to achieve the best noise-removing efficiency, human 
vision response [2] is accounted for by a perceptual normalization- (PN). These 
approaches are combined with our biorthogonal interpolating filters [3]. tp achieve 
excellent perceptual image restoration and de-noising. 

The biorthogonal interpolating filters are constructed by Swelden's lifting scheme 
[4] using Gaussian-Lagrange distributed approximating Junctionals (DAFs) [5] which 
were introduced as an efficient and powerful grid method for numerically solving partial 
differential equations with extremely high accuracy and computational efficiency. From 
the point of view of wavelet analysis, GLDAFs can be regarded as smooth low pass 
filters. 

PERCEPTUAL NORMALIZATION 

The main objective of wavelet analyses is to provide an efficient L\R) 
representation of a function. For signal and image processing, it is important to preserve 
the signal components of the corresponding subband filters. Wavelet coefficients can be 
regarded as the output of the signal passing through the equivalent decomposition filters 
(EDFs). 

The responses of the EDFs are the combination of several recurrent subband 
filters at different stages. Wavelet coefficients can be regarded as the output of the signal 
passing through the equivalent decomposition filters (EDF). The responses of the EDF 
are the combination of several recurrent subband filters at different stages.. As shown in 
Fig. 1, the EDF amplitudes of our DAF-wavelets in each sub-blocks are different. Thus 
the magnitude of the decomposition coefficients in each of the sub-blocks cannot exactly 
reproduce the actual strength of the signal components. To adjust the magnitude of thei 
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response in each block, the decomposition coefficients are re-scaled with respect a 
common magnitude standard. Thus EDF coefficients, CJJc), in block m should be 
multiplied with a magnitude scaling factor, A m9 to obtain an adjusted magnitude 
representation [1], This idea was recently extended to Group Normalization (GN) of 
wavelet packets for signal processing [1], 

The main objective of wavelet signal filtering is to preserve important signal 
components, and efficiently reduce noisy components. To achieve this goal, we utilize 
the human vision response to different frequency bands. An image can be regarded as the 
impulse response of the human visual system. The latter essentially consists of a series 
of "subband filters". It is interesting to note that, just like the non-uniform response of 
wavelet subband EDFs, the human "subband filters" also have non-uniform-responses 
to a real object. Using a just-noticeable perceptual matrix, we can efficiently remove the 
visual redundancy from decomposition coefficients and normalize them with respect to 
the standard of perception importance. A mathematical model for perception efficiency 
has presented by A. B. Watson, etc. [2] and can be used to construct the "perceptual 
lossless" quantization matrix Y m for normalizing visual response [2]. This treatment 
provides a simple human- vision-based threshold technique [6] for the restoration of the 
most important perceptual information in an image. For gray-scale image processing, the 
luminance (magnitude) of the image pixels is what we are most concerned with. The 
combination of the above mentioned group and visual response normalization can be 
called the Visual Group Normalization (VGN) of wavelet coefficients. 

BIORTHOGONAL INTERPOLATING DAF WAVELETS 

One of most important aspects of wavelet transforms is the wavelets themselves. 
From the point of view of applications, the stability, regularity, time-frequency 
localization and computational efficiency are the most important criteria for selecting 
wavelets. Interpolating wavelets are particularly efficient for basis set construction since 
their multiresolution spaces are identical to the discrete sampling spaces. In other words, 
there is no need for one to compute wavelet expansion coefficients by the usual inner 
products. This makes it easy to generate subband filters in a biorthogonal setting without 
requiring tedious iterations. Moreover, adaptive boundary treatments and non-uniform 
samplings can be easily implemented using interpolating methods. The lifting scheme 
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discussed by Swelden [4] is used in this work for constructing interpolating filters. 
We use the interpolating Gaussian-Lagrange DAF (GLDAF) [5] 

= KM n — 

as a scale function for our wavelet construction. Here WJx) is a window function and is 
selected as a Gaussian because it satisfies the minimum frame bound condition in 
quantum physics. The quantity cris a window width parameter, and PJx) is the usual 
Lagrange interpolation kernel. Biorthogonal dual DAF scaling functions and DAF- 
wavelets are constructed using Sweiden's lifting scheme [4]. The present DAF-wavelets 
are extremely smooth and rapidly decay in both time and frequency domains. As a 
consequence, they are free of Gibbs oscillations, which plague most conventional wavelet 
bases. As plotted in Fig. 2, our scaling functions and wavelets display excellent 
smoothness and rapid decay. 

DEMONSTRATIONS 

We use as benchmarks the 512x512 Y-component Lena and Barbara images to 
demonstrate the restoration efficiency of the present approach. The so-called "Lena" 
image possesses clear sharp edges, strong contrast and brightness. Thelhigh texture 
component and consequently high frequency edges in the Barbara -image create 
considerable difficulties for many commonly used filtering techniques. A simple low- 
pass filter will not only smooth out the high frequency noise but also blur the image 
edges, while a simple high-pass filter can preserve the texture edges but will also cause 
additional distortion. 

As shown in Fig. 3(a) and Fig. 4(a), the original Lena and Barbara images are 
each degraded by adding Gaussian random noise. The median filtering (with a 3x3 
window) result is shown in Fig. 3(b) and Fig. 4(b), which is edge-blurred with low visual 
quality. It is evident that our perceptual DAF wavelet technique (Fig. 3(c) and Fig. 4(c)) 
yields better edge-preservation and provides high visual restoration performance. 
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QUINCUNX INTERPOLATING 2D AND 3D WAVELETS-DAF 



INTRODUCTION 



Nowadays, wavelets has became an important area of research for image 



processing; however, most of the work done until today, it has been concentrated for the 



tensor product or known as well as separable wavelet transform. In this work we discuss 
a way for constructing "non-separable" wavelets in two and three dimensions using 
distributed approximating functional (known as DAF) . A recent work [daf004] 
published in 1998 proposes a way of constructing wavelets in one dimension called 
wavelets-DAFs. This work proves that this kind of wavelets improves the accuracy for 
solving linear and non-linear partial differential equations. Using a; non-separable 
sampling known as Quincunx for the 2D case and FCO for the 3D case (or Quincunx in 
3D) allows us to have a better perception for the human visual system. The first section 
of this paper will explain the Quincunx sampling in 2D and 3D (FCO). Then, the second 
section addresses the construction of the scaling function. The third section explains the 
way of getting the actual values for the interpolating multidimensional filter. 

NONSEPARABLE LATTICES 

The quincunx lattice is considered as the simplest way of sampling in two 
dimensions. We need in the 2D case two vectors to define the sampling, one possible 
value for those vectors is 



one dimensional case while the multi-dimensional case has been approached using the 




and 




d 



2 




which lead to the following dilation matrix 
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(1) 



In the Quincunx lattice, the subsampling process rotates the image by 45° and flips the 
image about the horizontal axis. 

As you can see in figure 1, the quincunx lattice has a checkerboard pattern. FCO 
lattice is used for the 3D case. The FCO lattice belongs also to the general checkerboard 
lattices. One of the possible dilation matrices for the FCO is 



notice that det(D)=M=2, for either quincunx or FCO. 
Scaling function 

Due to the fact that the scaling function behaves as generalized delta sequences 
we use the DAF approach to generate it. The two most important aspects in the design 



functions and their properties. First, the normal function in 2D defined asTollows 



fl o n 
£>= 1 1 0 
<0 1 \) 



(2) 



of the scaling function are the decay rate and smoothness. We studied the following 




which obviously has a Gaussian decay, and its sigma defines the width of the 
neighborhood points. Another, function considered because of its properties under the 
Fourier transform is the sine equation defined as follows 
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sinctf|x^+)^| = 



2 2 y 
smartx +y 



(4) 



The combination of these functions produce the father wavelets which are both smooth 
and rapidly decaying 

2 



2 1 
sin##|x +y 



2 2 



'2<r 



.2 



an 



2 2 1 



(5) 



The next section explain the domain of x and j> for equation 5. Figure 2 shows the scaling 
function for the quincunx lattice when a=o=l. Equation 5, it is clearly a nonseparable 
function. 



CONSTRUCTION OF THE QUINCUNX INTERPOLATING FILTER 

We want to interpolate the red dot shown on figure 3, which coordinates are 
(1/2,1/2). The plane xy is the one that we obtain after we downsample the original lattice 
using quincunx. Then, this plane was rotated by 45° with respect to the original lattice, 
and it was flipped about the horizontal axis. 

The circles in figure 3 represent, the values known from function. Those values 
will be used to approximate the value of the function at the red point (l/2 ; l/2) called P. 
Those points are labeled depending on the distance with respect to P. For instance, all 
points labeled "1" have the same distance to P, (2) (I/2) . Those points represent the filter 
taps. There has been some work on calculating the filter taps for the quincunx in 2D and 
3D, already; however, those filter taps are fix for the point at P at (1/2,1/2). In our work, 
we recalculate the filters depending on where in the grid is your new point P. The know 
values shown on figure 3 are fix, but not their labels. For instance, if our point P is 
located at (1/3,1/3) the values of the filter taps in the grid Will change as shown on figure 
4. 

This is something different with respect other works in this area. We calculate the 
value of those filter taps using equation 5, where x and y are the difference between the 
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points on the grid and the point P. The summation of all the values of the filter taps must 
be 1. In the case when we have only four points to approximate P, when P is located at 
(1/2,1/2) the result is trivial since we have four taps labeled one;therefore they must have 
the same value. Since their summation must be equal to one we resolve that those taps 
T(l)=l/4. However, when we have more points to approximate P, this is not that trivial. 
Using the D AF theory we have to look for the optimal sigma and alpha to estimate those 
filter taps. We could even consider different values of sigma depending on which points 
on the grid we are using to predict the value of the function at P. 

The selection of the scaling function was crucial, thinking about computational 
time. Moreover, since we use DAFs to calculate the filters taps equation 5 allows us to 
interpolate the n-derivate of the function at point P. 
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DAF APPROACH TO IMAGE RESTORATION 
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INTRODUCTION 

Anisotropic diffusion using nonlinear partial differential equations (PDEs) was 
proposed by Perona and Malik [1] for removal of noise, edge detection, and image 
enhancement. Considerable theoretical and numerical analyses and improvements have 
been carried out since its proposal [1-12]. In [1], the restored image is obtained by 
evolving the original noisy image K 0 (x,y) according to the following diffusion equation 
with a variable diffusion coefficient 

dufx.y.t) 

at = div Ml v. v«(*. y t t)] . (l) 

where, the initial condition is k(x^O)=m 0 (jcjO, where div defines the divergence 
operator, and c(|Vu|) is a nonincreasing positive diffusion coefficient function. It is 
interest to see that when c is a constant C, (1) reduces to the heat conduction or isotropic 
diffusion equation, as follows 

fltt(g t y,i) 2 

— — = CVu(x tVt t). (2) 



This equation has been used to identify objects in image processing [13]. However, 
using the isotropic diffusion for image restoration results in a severe edge=blurring 
problem, which is the principal reason for introducing anisotropic diffusion. The basic 
idea of anisotropic diffusion is: [1] for a region with small gradient |Vu|, the diffusion 
coefficient c will be large to encourage smoothing this region; for a large |Vu| region, c 
will be small to discourage the diffusion for the purpose of preserving the edges. In [1], 
Perona and Malik suggested the following two expressions for the diffusion coefficient, 

c( |V tt ,) = exp[-(l^) 2 ] 

(3) 



and 
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«\Vu\)=— 1— _ (4) 

where k is a constant used to provide a threshold for noise removal, i.e., to enhance the 
edges and smooth the homogeneous areas. However, as pointed out by many authors 
[2,4,6,8], such a procedure (1) with diffusion coefficients given by (3) or (4) is ill posed. 
When the image is corrupted with Gaussian noise, it can occur that the gradient 
produced by noise is comparable to that produced by the edges. Application of the 
anisotropic diffusion equation to such an image will tend to preserve both the image 
edges and the undesirable noisy edges simultaneous. Nevertheless, in order for the 
solution of the diffusion equation to be unique, |Vu|c(|Vu|) must be riondecreasing. 
Otherwise, the solution may diverge in time for some choices of numerical schemes. The 
same image with a minor difference amount of noise may lead much different solutions 
using the same numerical scheme. The solutions of different numerical schemes to the 
same image with the same amount of noise may also have much difference. To alleviate 
the ill-posed problem, it is crucial to detect the edges as accurately as possible, but the 
edges of a image are usually unknown to us. For this purpose, a selective smoothing of 
the diffusion coefficient in (1) was introduced in [2], as follows: 

* tt(a l' y,i) = dlv{c[|V(g(« i y)*«(« l y f t))|] 

di (5} 
x Vt^s.y,*)}, v > 

where g(xy) is a Gaussian filter to detect the edges and g{xy)*f{xy y t\ denotes the 
convolution operation at a given time t. This smoothing operation is not only limited to 
the use of a the Gaussian filter. In [9], a symmetric exponential filter was proposed, and 
was claimed to be more accurate for detection the position of edges. It is given by 

^.ir) = f«p(^(W + W )) 

(o) 

Using this smoothing filter, significant improvement in the quality of the restored images 
was attained [9]. However, equation (5) has a serious drawback of requiring compution 
of the convolution because it is required at each time step [8]. 

We disclose using a recently developed distributed approximating functional 
(DAF) approach [15-17] to solve the anisotropic diffusion problem. The DAFs used here 
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have several advantages. First, they can be used to spatially discretize the diffusion 
equation. Previous applications of the DAFs to nonlinear partial differential equations 
(PDEs) has shown that they are very accurate and stable PDE solvers [18-19]. Second, 
by proper choice of the parameters, the DAF evaluation of \\7f\ in the diffusion 
coefficient automatically produces smoothing. Third, a periodic boundary condition is 
added to the image, using DAFs according to a recently developed algorithm [20,22], 
making it possible to utilize DAFs and many other numerical schemes which require a 
knowledge of the boundary conditions. Fourth, the DAF can be used as an approximate 
ideal low pass filter to remove the high frequency noise before performing the time 
evolution. The success of DAFs applied to the anisotropic diffusion equation is due to 
their so called "well-tempered" property, which is the most important feature in the DAF 
theory [17]. 

DISTRIBUTED APPROXIMATING FUNCTIONAL FORMALISM 

The distributed approximating fiinctionals (DAFs) were introduced as 
"approximate identity kernels" used to approximate a function and its various derivatives, 
in terms of a discrete sampling [15-17]. One of the most important properties of one class 
of commonly used DAFs is the so-called "well-tempered" approximation, which 
distinguishes them from many other numerical schemes. The "well-tempered" DAFs are 
constructed such that there are no "special points" in the DAF approximation; Le. 9 the 
approximation to a function has a similar order of accuracy both on and off the input grid 
points. The DAFs also provide a "well-tempered" approximation to the derivatives. As 
long as the function and its derivatives remain in the "DAF-class", the "well-tempered" 
property will ensure that they have similar accuracy. In contrast, many other numerical 
schemes (e.g. 9 basis expansions, wavelets, splines, finite differences, finite elements, 
etc.), yield exact results for the function on the grid points, but typically at the expense 
of giving significant worse results for the function off the grid points. Such interpolative 
methods are always less accurate for the derivatives of the function. The drawback of the 
interpolative methods is especially distinct when the function is contaminated by noise, 
since the noise in the signal input data will be kept unchanged. In this case, a filter must 
be used to eliminate the noise in advance. Another feature of certain DAFs is that since 
they are approximate integral identity kernels, they yield an integral representation of 
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differential operators. By proper choice of the parameters, the DAFs provide a 
controllably accurate analytical representation of derivatives on and off the grid points, 
which is crucial to their success in many applications. One of the* most useful 
realizations of the "well-tempered" DAFs is the Hermite DAF, given by 

M.H-=^f:(-i)>-(.> (7) 

where z=(x-xy( & o), H 2n (z) is the usual (even order) Hermite polynomial, and <7and 
M are the DAF parameters. The HDAF is dominated by its Gaussian factor ex/?(-z 2 ), 
which serves to control the effective width of the function. In the limit of either M - °° 
or o - 0, the Hermite DAF becomes identical to the Dirac delta function. A function^*) 
is approximated by the Hermite DAF according to 



/(*) « W(s) = / 6 M (x - x»/(x')dz'. 



(8) 



For function sampling on a discrete, uniform grid, the Hermite DAF approximation to 
the function at any point x (on or off the grid point) is given by 



fDAF(*) = Aj>„-(« - «ik)/(x i ), 



(9) 



where A is the uniform grid spacing. Only terms in the sum from grid points sufficiently 
close to x contribute significantly. 

Approximations for various linear transformations of a function can also be 
generated using the Hermite DAF. Especially important cases are derivatives to various 
orders, given by 



«/-oo 



(10) 

where ^{x-x^d) is the 1th derivative of ty[x-x]d), and 



m 2-'/ J «p(-z 2 )^V 1\" (-1)' „ 

(11) 
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These expressions can be discretized by quadrature, 

* ySUw = - *i w/(«y) (12) 

Equation (11) and (12) provide convenient methods for calculating derivatives in solving 
nonlinear partial differential equations (NPDEs). 

The HDAF approximation can be easily extended to a multi-dimensional case as 
a direct product. For two-dimension function ftxy), one has 
W(x, y ) = A*&,52Su.(*-z h \*.) 

X E^.(»-W»k,)/(xy„» /l ) (13) 

and •>'» 



J2 



where f DAF (hlB) (x,y) denotes 1th partial derivative and mth partial derivative of the function 
with respect to x and y respectively. 

The Hermite DAF has the so-called "well-tempered" property [17], and it is a low 
pass filter. In Fig. 1, we show plots of Hermite DAFs, obtained using two different set 
of parameters, in (a) coordinate space and (b) frequency space, respectively. Their first 
order derivatives are plotted in Fig.l (c), in coordinate space. The solid line is for 
<^=3.54, M~\20 and the dashed line is for o=2.36, Af=30. The solid line is said to be in 
the less "well-tempered" or more "interpolative" regime, and the dashed line is in the 
more "well-tempered" (smoothing) regime . In contrast to the sine function sin(wx)/7tx, 
with w=7t/A (the ideal interpolator) being the ideal interpolation and w<n/A being the 
low pass filter, the Hermite DAF is highly banded in coordinate space due to the 
presence of the Gaussian factor, which means that only a small number of values are 
required on each side of the point x in (1), as is also true for the derivatives of the 
Hermite DAF. This is clearly seen in Figs. 1(a), and 1(c). With proper choice of 
parameters, the Hermite DAF is an excellent filter (see the dashed.line in Fig. 1). For the 
case far away from interpolation, the DAF approximation is not exact on grids (The 
Hermite DAF is not equal to 1 at the origin and not equal to 0 at nA, n*0). The DAF 
approximation to the function depends not only on the function value at the grid point 
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itself, but also on values at the grid points close to it. The approximation to a function 
has about the same order of accuracy both on and off the grid points. This 
"well-tempered" property of DAFs plays an important role in our periodic extension 
algorithm. Application of the "well-tempered" DAF to approximate the function and its 
derivatives is relatively insensitive to the presence of noise. As will be seen in this paper, 
the "well-tempered" Hermite DAF serves as a very good smoothing filter to accurately 
detect the image edges. Application of the "well-tempered" Hermite DAF to the variable 
of the diffusion coefficient in the anisotropic diffusion equation (1) avoids the need for 
an additional smoothing operation. The Hermite DAF will perform the discretization and 
the smoothing operation on the function simultaneously. Since the "well-tempered" 
Hermite DAF is also a low pass filter, by proper choice of the DAF parameters, (13) can 
be used to eliminate the high frequency noise of a signal once the boundary condition 
is known. However, the boundary condition is usually unknown. In the following 
section, we will give a brief review of how a periodic boundary condition is imposed, 
using the DAFs 

DAF-BASED METHOD OF PERIODIC EXTENSION 

For a signal of finite length, it is common requirement for many numerical 
schemes that one know the boundary conditions of the signal in order to avoid "aliasing". 
Take the noncausal finite impulse response filters (FIRs) as an example. In order to get 
the desired filtering results, the noncausal FIRs require a knowledge of the^signal values 
in the future, which are impossible to obtain in the advance. Also, there are many 
numerical analysis algorithms for which it is computationally efficient for the signal to 
be a given length, which is not usually satisfied in experimental results. In this section, 
we show how our algorithm is employed to impose a periodic boundary condition for the 
signal. The algorithm of periodic extension has been discussed in detail in [20[ and [22]. 
We will give a brief description on how it is implemented. 

The method is closely related to the DAF extrapolation algorithm presented in 
[21]. Assume a limited length of a signal \{f 1 ,f 2 ,...,f J .j} is known, .and we require it to be 
periodic with the period K so that K-J+l values \{fj,f J+I ,...,f K \} are left to be determined. 
Since now the signal is periodic, the values Ntf^,,^,...,^,} are also known and equal 
to \{f|,f 2 ,--.,-fi.|} respectively. Under the assumption that the signal is in the "DAF-class", 



WO 99/46731 



PCI7US99/05426 



198 

the unknown values from f ; to f K can be determined by minimizing the following "cost 
function", constructed using the "well-tempered" Hermite DAF: 

c s £ w p (/(■*) - » (15) 

where W p is a weight related to the grid point x p and f_{DAF}(Xp) is the DAF 
approximation to the function at Xp. The summation range for "p" is selected according 
to the problem considered. From (9) and (15), we have 



x 2 

(p+u> \ 
/(* P ) - £ M-»— w/wi 
t=p-« 



csL^p-^-" y 06) 

where w is the DAF half bandwidth. We minimize the "cost function" with respect to the 
variation of the unknowns function according to 

dC = 0, J<KK. 
Bf(*i) - " (17) 

Equations (16) and (17) generate a set of linear simultaneous algebraic equations, given 
by 

£ 2 w p |/(«p)- E *m(«p-«iW/(»i)) 

p=/-to \ t=p-to / 

x («pi-M«P-*fk)) = 0t ^^'^^ ( 18 ^ 

where J pI is the kronecker delta function. These linear algebraic equations^can be solved 
by standard algorithms [18] to predict values of the function in the gap. 

Note that it is the "well-tempered" property of the DAFs that makes the above 
algorithm possible. For interpolation algorithms, the value on each grid point is exact and 
does not dependent on the values of the function at any other grid points, which means 
that the "cost function" is always zero for points on the grid. In contrast, the 
"well-tempered" DAFs are not required to give a exact representation on the grid points. 
The approximate values on a grid point are related to the nearby values. - 

We must stress that this periodic extension of an nonperiodic signal is basically 
different from extrapolation, although they use similar algorithm. The "pseudo-signal" 
in the extended domain is only used as an aid to treat or analyze the original signal in a 
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way that doesn't significantly disturb the behavior in the known region. In general, one 
does not obtain true signal in this domain. The beauty of the present periodic extension 
algorithm is that it provides a boundary condition of the signal without significant 
"aliasing", as was shown in [20] and [22] Once the periodic signal is obtained, it can be 
extended as much as desired, depending on the intended numerical application. In 
contrast, extrapolation is required to accurately predict the true function values in the 
extended domain. 

It must be noted that although we have discussed the algorithm in one dimension, 
extending it to two or more dimensions is straightforward. The DAF approximation to 
a function and its derivatives are given in (13) and (14), respectively. However, a direct 
two-dimensional extrapolation is a time and memory consuming procedure because too 
many simultaneous algebraic equations must be solved. One alternative to this is to 
consider the two dimensional patterns as composed of many lines of one-dimensional 
signals, and then extend each line separately. This procedure is generally less accurate 
than the direct two dimensional extrapolation algorithm because it only considers the 
correlation along one direction while neglecting cross correlation. However, for many 
problems, it is sufficiently accurate, and it is a very economical procedure. 

The algorithm presented in this section only refers to periodic extension as an 
example. However, we note that one can also extend the original signal to any other 
kinds of boundary conditions, as needed for different numerical applications. Periodic 
extension is only one case out of many possible ones. 

NUMERICAL EXAMPLES 

Numerical experiments are preformed using the "Lena image", shown in Fig. 2. 

We degrade it with Gaussian white noise with peak signal-to-noise ratio (PSNR) of 

22.14 dB [Fig. 3(a)] and 18.76 dB [Fig. 4(a)] respectively. The PSNR used here is {\it 

ldefined} to be 255 x 255 

PSNR = 

MSE (19) 
where MSE is the mean-square-error of the noisy image, defined by 

^E^E^E^i)-^,,)] 2 , (20) 
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where S(ij) and S(i j) are the original image and noisy image samples respectively, and 
N x and N y are the number of pixels horizontally and vertically respectively. For the 
purpose of effectively removing the noise in the homogeneous, areas while 
simultaneously preserving the edges, we choose the following definition of the diffusion 
coefficient [8] 



c(s) 



a eL£±iZ1L 



(21) 



where A, T> e and p are adjustable parameters. Here we fix A=100, T=5, ;e=l and p=0.5 
in our numerical applications. The anisotropic diffusion equation (1) is spatially 
discretized using the Hermite DAFs according to (14). The more "well-tempered" 
Hermite DAFs (o x /A x =o y /A y =2.36, 1^=1^=30 for the lower noise image and M x =M y =22 
for the higher noise image) are used to discretize the variable of the diffusion coefficient 
to remove more noise and accurately detect the edges. A more accurate Hermite DAF 
(o_x/A x =o y /A y =3.54, 1^=1^=120) is employed to discretize the other part of the spatial 
differential operation accurately. The fourth order Runge-Kutta method [23] is employed 
to perform the time evolution. Before carrying out the time propagation, we used the 
HDAF with o x =o y =2.36, M x ==M y =32 to remove the high frequency noise. 

Fig. 3(b) and Fig. 4(b) show the resulting restored Lena images of produced by 
our numerical algorithm. Comparing them with the degraded images [Figr 3(a) and Fig. 
4(a)] and the original image (Fig. 2), it is clearly seen that the algorithm can effectively 
remove noise while preserving the edges simultaneously. The PSNR is increased by 
about 8.00 dB for the lower noise image and 9.43 dB for the higher noise image. The 
boundary of the image is also effectively preserved, which shows the success of our 
periodic extension algorithm. As pointed in [9], for noisier images, the Perona-Malik 
algorithm may produce some noise echos that are larger than the threshold level which 
can not be removed. However, increasing the threshold level will lead to the 
edge-blurring problem. By using the algorithm presented in this paper, we can 
effectively avoid the problem by accurately detecting the edges using the 
"well-tempered" HDAF. The procedure of using the "well-tempered" Hermite DAF to 
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detect the edges automatically avoids having to apply an additional smoothing operation 
to the variable of the diffusion coefficient. We also test the success of our algorithm for 
even noiser images and find that it can increase the PSNR enormously (e.g., we have 
increased the PSNR of the Gaussian noise degraded Lena image from 1 1 35 dB to 23.10 
dB). Computations with the smoothing filter (6) applied to the variable of the diffusion 
coefficient, (as presented in [9]), shows that the "well-tempered" Hermite DAF is better 
able to detect and preserve the edges than the exponential filter. 

CONCLUSIONS 

New computational schemes are disclosed to restore a noise corrupted image 
based on solving an anisotropic diffusion equation. The distributed approximating 
functional (DAF) approach introduced in this paper has several advantages for 
application to the anisotropic diffusion equation. First, it provides a periodic boundary 
condition to the image. Second, it provides a smoothing operation to the variable of the 
diffusion coefficient. Third, it discretizes the spatial derivatives of the equation as 
accurately as required for a given application. In addition, before performing the time 
evolution using the forth order Runge-Kutta method, the DAF can be used to eliminate 
the high frequency noise from the image. Test results on a noisy Lena image show that 
it is effective for removing the noise and preserving the edges of the image 
simultaneously. It can greatly increase the PSNR of the noisy image. The resulting 
periodic extended images can be implemented with many other numerical schemes which 
require a knowledge of the signal in an extended domain or prefer a given number of 
image samples. 
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TABLE I 

Comparative Restoration Results in PSNR for Image Lena 
Corrupted with different amount of Gaussian noise. 



Conupted image 



Algorithm 



22.14 dB I 18.76 dB 



Median filter (3 x 3) 



27.24 dB 24.72 dB 



Median filter (5 x 5) 



26.53 dB 25.29 dB . 



New approach 



30.14 dB 28.19 dB 
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A VARIATIONAL APPROACH TO THE DIRICHLET-G ABOR WAVELET-DAF 
INTRODUCTION 

Recently we have introduced several new types of distributed approximatitfg functional 
(DAFs) [1-5], and related wavelet bases associated with them [4,6-10]. In the course of 
that work, it was observed that DAFs could be generated in several ways. \ The first 
approach [2,3] leads to a systematic way of approximating a given discrete set of input 
data with an infinitely smooth function. \ The most intensively studied DAF of this type 
is called the Hermite DAF, or HDAF. The fundamental unit of its construction is a 
product of a Hermite polynomial and its generating function, referenced to an origin that 
is located at each point, x. A variational method was introduced for deriving such DAFs 
[3], and they have been applied to a large number of problems, ranging from solving 
various linear and nonlinear partial differential equations (PDEs) to the entire gamut of 
signal processing [6,7, 1 1 - 14] . A distinctive property of the first DAFs is that they are not 
interpolate on the input grid points [2,3]. That is, the first type of DAF approximation 
to the function at any grid point, x p is not exactly equal to the input data value. In place 
of the interpolative property, this DAF approach to functional approximation has the 
property that there are no "special points". Said another way, such DAFs deliver similar 
accuracy for approximating the function either on or off the grid; similarly, the DAF 
approximation to a function, sampled discretely, yields an approximation to the 
derivatives of the function comparable in accuracy to the function itself. This is strictly 
true only for functions belonging to the "D AF-class", which is that set of functions whose 
Fourier transform is sufficiently contained under the DAF-window in Fourier space [3]. 

More recently, we have developed another general type of DAF which does 
interpolate on the grid, but which still can be "tuned" to yield highly accurate derivatives 
for D AF-class functions [4-7,11-14]. The essence of this approach is to modify an 
"interpolating shell" (such as that for Lagrange Interpolation [5], etc.) by an appropriate 
weighting function. By far the most attractive choice has been a Gaussian weight 
function, which has the property of "regularizing" the interpolation so that it delivers an 
infinitely smooth approximation to discretely sampled functions [4-7], and the accuracy 
is ensured so long as the function being considered is in the DAF class. Again, these 
have been shown to be enormously robust for the class of PDEs and signal processing 
problems considered earlier [5-7, 11-14]. 
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An alternative way of viewing these DAFs results from observing that continuous DAFs 
constitute two-parameter Dirac delta sequences [4]. That is, they are approximate 
identity transforms that depend on two adjustable parameters. In the case, e.g., of the 
HDAFs, the two parameters are the Gaussian width, a , and the highest degree 
polynomial, M (where M is even) 

where, it is easily shown that 



Urn $?f(*-*V) «*(»-«') (2) 

M—oo 

for any o >0, and also that 

Jim «g? F (* - x'\a) = 5(x - x') (3) 

for and fixed M. The availability of two independent parameters, either of which can be 
used to generate the identity kernel or Dirac delta function, can be viewed as the source 
of robustness of the DAFs as computational tools [4]. 

Of the recently introduced regularized, interpolation DAFs, a potentially very 
useful one is the Dirichlet-Gabor wavelet-DAF (DGWD) \cite{b}. \ It was constructed 
\ by combining a Gaussian with the Dirichlet kernel for generating the Fourier series of 
a function, to give 



As with all of the regularized, interpolating DAFs, this product generates a scaling 
wavelet that at once is infinitely smooth and rapidly decaying in both physical and 
Fourier space [4]. The constant, C^,, was determined by requiring that the zero 
frequency Fourier transform, ^ m (0|o ), be normalized to unity, that is 
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»0) = 5<£g wD (0W) = £ d X 6M WD (x\cr) = 1 (5) 

Then the "father wavelet" basis is generated by translating and scaling, scrthat [4] 

-m x — nb ^ 



A corresponding "mother wavelet" can be defined as 



Because of the constraint on i|r(0), Eq(6), it is verified that x|f(x) is a "small wave", so its 
zero frequency transform satisfies 



•$(0) = [°° dx^(x) = 0 

J— oo 



(8) 



The computational usefulness of the DDGWD was shown by several example 
applications to the solution of differential equations [5,11-14]. A multiresolution 
analysis has been developed based on these wavelets [4,6,7]. 



In this paper we enquire as to whether these regularized interpolation DAFs can also be 
obtained in a systematic manner from the same variational principle [3] used for the 
noninterpolating DAFs; especially the HDAF [2,3]. We shall see that the DGWD can 
indeed be obtained directly from our variational principle, and the derivation bears a 
similarity to that used for the Hermite DAFs. In the next Section, we give the detailed 
derivation of the DGWD from the variational principle. 

VARIATIONAL PRINCIPLE APPLIED TO THE MRICHLET DAF 

A general construction of the DAF approximation to a function proceeds by first 
developing a suitable approximation to the function at every point x in its domain. \ This 
is typically accomplished by making a basis set expansion of the form 
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/(«>) = E*iM»)*i(*) • (9) 

J 

Here f(x A {\prime } |x) is an approximation to the function f(x A {\prime } ) .about the point 
x, i.e., parameterized by x. \ The quantity % B_ (j } (x A {\prime } |x) is the j th basis function 
for the point x and % b_{j}(x) is the corresponding coefficient of this basis function for 
the expansion centered on the point x. \ The coefficients b_{j}(x) remain to be 
determined as functional of the known values of f(x). A succinct expression for the D AF 
approximation can then be given by 

fDAF{x) = /(X|«) (1Q) 

(although, as previously mentioned, other, more general, definitions, e.g. as 
parameterized delta sequences, are also possible [4]). To complete the definition one 
must specify how the x-dependent coefficients are to be obtained. 

There are various ways that the set of coefficients {b^x)} can be determined. Perhaps the 
most straightforward is by the technique of "moving least squares". In this approach one 
defines a variational function A,(x) for the point x of the form 

X{x)^^{xi-x)\f{x l \x)-f{x l )\ 2 , (n) 

where the summation over / is over all points in the domain of x where the value of the 
function is known. (We replace the sum by an integral over all continuous regions of the 
domain where the function is known.) The quantity <o(x r x) is a weight function of 
arbitrary form, restricted only in that it is non-negative. For concreteness we will take 
a> to be of the Gaussian form 

u>(z) = e _x2 / 2<r2 
V 1 6 > (12) 



where o is a parameter with units of length. It should be pointed out that, in general, the 
form of the weight can also vary as a function ofx as can the basis Junctions themselves 
in both type and number. (For example, we could make o vary with x, and/or choose 
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BfaW from different complete sets for each distinct value of x.) \ We then write 

/(»*!») « X^iM 3 ^ (13) 

i 

and determine the optimal values of these coefficients at a particular value of x by 
minimizing the "cost 11 function A(x). In general the expansion coefficients can be 
complex. In anticipation of this eventuality, we minimize the cost function with respect 
to both the coefficients and their complex conjugates to obtain 2N equations to solve for 
the real and imaginary parts of the TV coefficients. This leads to the set of linear equations 

f) u>(z t - x)B m j (x l \x)f{xi) = £{ £ " ^)Bj(xi\x)Bf{xi\x)}b r , 
which can be written compactly as 



(15) 



by defining 

Aj= f) u i {x l -x)B){x l \x)f{x l ) 



i=-oo 



(16) 



and 



Ctf = £ «(xi - x)B* j {xi\x)B j »{x t \x) (17) 

(=-oo 

We immediately have that 

^i = E(£" 1 )ii ,A i / • 08) 
i' 

// is important to recall that all of the quantities in this equation are implicit Junctions 
ofx. 

To proceed we confine our discussion to functions on the real line and represent 
fixty), our local approximation to the function centered on the point x, as a Fourier series. 
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The basis functions then are 



Bjix'lx) = ^= e -*«V-*W»* (19) 



where j assumes the lvalues -(N-l)/2zjz(N-l)/2. (Note that j takes on integer values 
for odd N and half-integer values for even N .) Here A is the grid spacing, which is 
assumed to be uniform. As a function of x\j{x)x) is obviously periodic with a period 
domain of 7VA . From Eq.(17) it is seen that £ is a kind of overlap matrix for the basis 
functions centered at x under the weight function o>(x -jc), and due to the periodicity of 
the basis functions, we can express it solely as a function of r/=mod^(x^x). As we now 
show, it is possible to invert the matrix \underline£(T]) in closed form. However, it is an 
approximation to the inverse that, when valid, gives rise to the D AF representation of the 
function which is of interest to us here. 

It is useful to write the sum in Eq.(16) in the form 

00 oo 

Y, 0i = £ XX)p* > (20) 

l=-oo p=-oo Q 

where l=Np+q. Here we have divided the grid into domains, each with Appoints. The 
/?-sum is over all domains and the q-sum is over all points within a given domain. We 
take the point of origin (i.e., /?=0, q-0) to be the grid point closest to x. Then 

xi-x = {Np + q)& + V , (21) 
where -A/2zr\z A/2. That is x+?is the grid point closest to x. \ This leads to 

1 oo N-l 



N 

p=-oo 7=0 



9=0 



(22) 



where 
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oo 



P=-oo 



■7/AJA) 
and 

= ^T e 2H( q + n ,A )j/N ^ (24) 

The quantity \|ij(q+r|/A) can be taken as the jth component of an orthonormal basis set 
oftf-vectors indexed on q. That is 



(N-l)/2 

w ~" > , oc . 



which is a standard result from Fourier theory. \ From this point of view, Eq.(\ref { eq 1 3 } ) 
simply gives an expression for the C_{jj A {\prime }} matrix element of 
\underline{\underline{C} }(\eta ) in its spectral representation. (Here we have indicated 
explicitly that the matrix is a function of \eta .) \ From this it follows immediately that 

mr );;< = £ T 1 - V'i' + " M (26) 
and, further, from Eq.( 1 8) that 

bi(^)=E(ac 7 ?r 1 )«' A >' 

i' 

= EE 3r i-^'^>- £ _ ,)^) /(l „) . < 27 > 



The sum over / here produces the Kronecker 6 qq ., where; 



o / + 77/A = mod( — - — ) - 

H T at v A (28) 
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Summing over q then leads to 



3KU _ A 0 / +n /A 



WA v ' (29) 



which is the desired variational expression for the expansion coefficients. Finally, from 
Eqs.(lO), (13) and (19) we have that 

I ("-i)/2 
W(x) = -= £ 

ViV i= _ (Ar _ 1)/2 



^ 1 1 ( " _1 > /2 



which is the formal-DAF expression without approximation. 

If f(x) is periodic with period NA, then/[;c r ) depends on only q' (i.e., not on p') 
and the final result of Eq.(30) reduces to 

where we have made use of Eqs.(21) and (23). (Recall that jc+t] is the grid point closest 
to x.) This is just the standard Fourier approximation to a periodic function known on 
N equally spaced grid points. It is interpolative (/.e.,/ DAF (*i)=/(X|)) where x x is any grid 
point and hence for which T| =0. This is, of course, the anticipated result for a 
least-squares fit of a periodic function using a Fourier basis. 

If fix) is not periodic then f DA? (x) is nowhere exact (unless accidentally so), and, 
in particular, the DAF approximation is not interpolative. The quantity 'A +tj +x) in 
Eq.(31) is replaced by 



A' - JL ~"7 /(*/') ', 

p'=-oo V+fj/A 



(32) 
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which is a weighted average across the infinite grid of functional values on grid points 
separated by multiples of NA . 

It is clear that the DAF approximation of Eq.(30) (being basically. aFourier sum) 
suffers from the principal drawback of the Fourier representation, namely that the 
approximation is not tightly banded. That is (off the grid) all of the N values of ~"f q 
contribute more or less equivalently to the approximation. Said another way, each grid 
point contributes through the normalized probability w^-jcJ/A.^^, which falls off much 
more slowly than <o(x r x) itself as q is varied. To introduce a more tightly banded DAF 
representation of the function, we now assume that \underline Q(t]) can be effectively 
replaced by a matrix that is independent of r\. In so doing we, of course, ignore 
variations in Q(t]) over the distance of the grid spacing. There are various ways that this 
can be done. In previous studies, where we employed a polynomial basis set rather than 
the circular functions of Eq.(19), it proved convenient to replace Q(i)) by its average. 
This allowed us to use the properties of orthogonal polynomials to construct the 
corresponding approximation to C l . We referred to the resulting representation of the 
function as well-tempered because it has the property that for functions where the 
approximation is applicable (so-called DAF-class functions) the fit is of comparable 
accuracy both on and off the grid. In contrast, in the present case it is convenient to make 
an ^-independent approximation to C(r\) for which the grid points are special. 

The idea is that as N becomes large and the grid spacing becomes small in such 
a way that NXDelta is held constant, every point becomes effectively-a grid point 
(assuming continuity of the function to be fit). Then, to controllable accuracy we can 
replace C(r\) by C(t]=0) to obtain 



(34) 



EE 

j' * 



/'=-oo 



and 
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(*-l)/2 



we*) « f £ «(«!• - *)ts5 £ (35) 



E «(«,-*) E e- 2 ^'-*»'/V(s„) . 



l'=-oo j'=-(N-\)/2 

where L=N&. The applicability of this approximation depends of course on the 
appropriate choice of the DAF parameters, which has been discussed elsewhere. \ The 
sum can be written in terms of the Mh order Dirichlet kernel, D M (y), defined by 

^W = i[i + E-W] = ^lf! , (36) 

* *=i 27rsin(y/2) ' 

which leads to the expression 

52 u^{x l ^x)DN fL { - )f{x r ) . (3?) 

This result is parameterized by three quantities L, N (which are related by the grid 
spacing A) and a (see the form of co of Eq.(12). 

Since our approximation C(t]) * £(rj=0) is exact on the grid, this approximation 
is interpolative for functions that are periodic on a domain of length L. \ If we take the 
limit N -«> and L -ooi n such a way that A=Z/Afis fixed, then the approximation assumes 
the sinc-DAF form 

- /x ^ f / sin(27r(rr - x)/A) 

W(X) = - J>(«, - X) -_J_£ /W , (3g) 

where we have used the fact that A 0 =l in this limit. This 
result is interpolative on all grid points. 

CONCLUSION 

We have shown that the variational principle used earlier for generating 
noninterpolating DAFs (which could be used to generate associated wavelets) can also 
be used to derive interpolating DAFs, with a Gaussian weight, that were first obtained 
by multiplying various interpolation shells with a Gaussian, which regularized the. 
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function (making it infinitely differentiable) and ensured that it decays rapidly both in 
physical and Fourier space. We therefore conclude that the interpolating and 
noninterpolating DAFs are very closely related, corresponding to different ways of 
solving the moving least squares variational algebraic equations. This result complements 
the earlier procedure used to construct the interpolating DAFs and provides another 
framework in which to develop robust approximation and estimation algorithms. Both 
the interpolating and noninterpolating DAFs, of course, have been shown previously to 
be computationally robust. [1-2,6-7,1 1-16] 
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WAVELETS. DELTA SEQUENCES AND DAFs 
INTRODUCTION 

The rapid development and great success of wavelet theory and technology in the 
last decade [1-7] have stimulated intense interest among mathematicians, engineers, 
physicists and chemists. New results are regularly reported and applications are found 
in virtually every discipline of science and engineering [8-10]. The basic theory and 
construction procedures are regarded as well understood. However, there are areas where 
the existing wavelet methods encounter difficulties. One such area is computational fluid 
dynamics and more generally computational chemistry and physics, and in mechanics. 
Meyer [4] has recently posed the questions "Can wavelets play a part inthe study or the 
understanding of the Navier-Stokes equations?" He concluded that "we still do.not know 
the answer to this question". Since wavelets are intimately and significantly related to 
spline theory and the theory of approximations, it is likely that wavelet theory can lead 
to entirely new approaches for scientific and engineering computations, where traditional 
method are either global or local. It is well known that global spectral methods are 
accurate and efficient for linear partial differential equations (PDEs), whereas local 
methods are simple and convenient for nonlinear PDEs. It is extremely important to 
develop an approach that delivers global method accuracy, while also providing local 
method flexibility and simplicity, for nonlinear PDEs involving singularities, or 
homoclinic orbits, for which obtaining accurate and stable numerical solutions is still a 
major challenge [11,12]. Wavelet theory has been intensively studied for this -purpose 
[13-23]. 

The rapid growth and unprecedented success of wavelets have led most recent 
researchers to focus more on the exploration of new applications rather than reflecting 
on the basic concepts. The predominant view of wavelets has been strongly influenced 
by the belief that there are analysis tools for describing a signal efficiently in time and 
in frequency simultaneously, thus overcoming the classical limitation ofFourier analysis, 
which is strictly efficient either in the time or the frequency domain. As pointed but by 
Flandrin and Goncalves [24], wavelets and wavelet transforms are basically time-scale 
tools, rather than time-frequency ones. Moreover, as far as time-frequency analysis is 
concerned, there are many other approaches which are able to outperform both Fourier. 
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analysis and wavelets in certain cases. There are other views, which are somewhat less 
influential and less developed. For example, Holschneider has recognized that a real- 
valued, non-negative scaling wavelet provides a smoothed version of a function /over 
the real line R by means of the convoluation product 

*->(/) =< Mb), f >= f°° dt-<f> (^] /(t)5£(6) * /. (1) (1) 

J— oo a, \ CL J 



Then a wavelet transform given by 




Where \Ji(x)=(xd+l)(j>(x) and i|/ 0 (x/a)/a, provides a mathematical microscopy of 
/ at length scale a. These are important ideas and they deserve additional study. 

In a separate development, Hoffman, Kouri and co-workers [23-29] have 
presented a powerful computational method based on distributed approximating 
functional (DAFs), for various numerical applications, including solving linear [30] and 
nonlinear [31] partial differential equations (PDEs), signal analysis and the padding of 
experimental data. On the real line R, the DAFs are multiparameter delta sequences of 
the Dirichlet type, constructed using functions of the Schwartz class. The success of both 
wavelets and DAF's in a variety of applications has led to a search for a connection 
between them. Indeed, there is a natural connection between the [32] and numerous 
wavelet-DAFs and DAF-wavelets have been discovered [32,33]. Wavelet-DAFs have 
been used to obtain results in solving the Navier-Stokes equations [34] with nonperiodic 
boundary conditions. In fact, underlying both wavelets and DAFs is a common 
mathematical structure, the theory of distributions, which was initiated by physicists and 
engineers, and was later presented in a rigorous mathematical for by Schwartz [35]. 
Korevaar [36] and others. A general distribution analysis of wavelets has been given by 
Meyer [1], Meyer and Coifinan [4]. Daubechies [3] and others [37], Orthogonal wavelet 
expansions of the delta distribution are discussed by Walter [38]. . However, the role of 
delta sequences in wavelet theory has hardly been addressed. In particular, .a large class 
of delta sequences can be identified as scale wavelets, and they can be used as multi- 
resolution wavelet bases, which we call delta-sequence-generated multi-resolutioa 
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wavelet bases (or DAF-wavelets). The purpose of the present work is to provide a 
unified description of wavelets, delta sequences and DAFs. 

REVIEW OF DELTA SEQUENCES 

The delta distribution or so-called Dirac delta function began with the Heaviside 
calculus and was informally used by physicists and engineers before Sobolev, Schwartz 
[35], Korevaar [36] and others put it into a rigorous mathematical form. There are 
several formal mathematical constructions which have been used in the literature (35,36). 
We shall take the approach which obtained the delta function as a sequence limit (2). 

A sequence of functions d a (x) 6 4>(jc), is a delta sequence on the domain J if 
for each x e J and suitable functions 



where the sequences 6 a are generalizations of Cauchy sequences and are called 
fundamental families on J by Korevaar [36]. 

Two types of delta sequences (the positive type and the Dirichlet type) can be 
used as the basis of a useful classification scheme. Both types are discussed in the 
following two subsections. (Moreover, classifying delta sequences according to whether 
they belong to the Schwartz class or non-Schwartz class is also very useful foF various 
physical and engineering applications.) 

Delta Sequences of the Positive Type 

Let {4} be a sequence of functions on (- 00 , 00 ) which are integrable over every 
bounded interval. A delta sequence { S a } is of positive type if, for any bounded interval 
containing the origin, the integral in the a— >a 0 limit of the sequence of the functions 
equals 1. For bounded intervals excluding the origin, the integral of the sequence of 
functions equals zero in this limit. All members of thesequence are positive semi- 
definite. To illustrate, we consider first the example of a delta sequence of-impulse 
function. 

To approximate idealized physical concepts such as the force density of a unit . 




(3) 
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force at the origin x - 0, or a unit impulse at time jc = 0, one has 




a for 0 < x < 1/a a = 1, 2, • • • 



(4) 



0 otherwise 



as a delta sequence in the limit a— ►<». 

A second example is Gauss' delta sequence, given by 




(5) 



and it arises naturally as a distribution solution or so-called weak solutions Gauss' delta 



and Fourier transforms, and is used to generate the "Mexican hat" wavelet. Various other 
wavelets can also be generated from it as described in later sections of this paper. 
A third example is Lorentz's delta sequence, 



which is used in representing the solution of Laplace's equation in the upper half plane. 
It is also commonly seen in integral equations involving the Green's function of the 
kinetic energy operator (in the momentum representation). 

Other examples included Landau's, Poisson's and Fejer's delta sequences. 

A systematic procedure for generating various delta sequences of thepositi ve type 
is a follows: 



sequence has various interesting properties with regard to differentiability, boundedness 



u 2 + a ! 



a 



for a -» o 



(6) 



Pa {x)=l p{ l ) (a>Q)j 



(7) 



Delta Sequences of the Dirichlet Type 

Definition 4. Let {6a} be a sequence of functions o (-~, °°) which — every 
bounded internal. We call {6a} a delta sequence of the Dirichlet 



1. 



J6a-lasa-a 0 For some finite constant a. 
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2. For every constant 7 > 0, (/T^ + J o ->0asa-^a 0 . 

3. There are positive constants - and - such that 

\S Q (x)\<^- + C 2 
\ x \ 

for all x and a. 

Example 1. Dirichlet s delta sequence 

An important example of a delta sequence of the Dirichlet type is Dirichlet's delta 
sequence 

D Q (x) for |x|<7r for a = 0,l,2,... 
S «( x ) = \ m ; (1-7) 

0 otherwise 



where D a is the Dirichlet kernel given by Eq. (12). Dirichlet's delta sequence plays an 
important role in approximation theory and is the key element in trigonometric 
polynomial approximations. 

Example 2. Modified Dirichlet P s delta sequence 
Sometimes there is a slight advantage in taking the last term in D a with a factor of VS: 

^) = ^a»^cos(ax) 
sin(ax) 

-2^(1*)' a = 0 ' 1 ' 2 '-- (18) 



This is the so-called modified Dirichlet kernel. The difference D a - D a * tends uniformly 
to zero on (-71, tt) as a - °°. They are equivalent with respect to convergence. The 
expression given by 

f/ . D Z(*) for|x|<ir for a = 0,1,2,-. • 
M s ) = S - (19) 

0 otherwise 

is a delta sequence of Dirichlet type as a - ~. 

Example 3. Dirichlet s continuous delta sequence 
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Dirichlet's continuous delta sequence is given by the following Fourier transform of the 
characteristic function x[* a > a ]- 



sin(as) (20) 

XI 



This converges to the delta distribution as a - ~. Eq. (20) is related to Shannon's 
sampling theorem in information theory. 

Example 4. The de la Vallee Poussin delta sequence 
The de la Vallee Poussin delta sequence is given by the Fourier inverse of the following 
function 



Uo = 



t\<« 

, <i*l: 

^ otherwise 



It is easy to show that 
j Q ( B ) = A CQ s(ai) - cos(2az) 



xa 



(22) 



is a delta sequence of the Dirichlet type as a - «>. 

Example 5. Interpolative delta sequence 
Let {6 n } be a sequence and let {xj 0 n be n + 1 zeroes of a Jacobi polynomial in (a, b). 

^■■ (■-rtS^-.,) ^--* - ew) 1235 

> 

is a delta sequence as n -* °°. This follows from the fact that j'& n (x,y)fly)dy 
approximations to the Lagrange interpolating formulae which converges as n - « and sup 



Example 6. Delta sequences constructed by orthogonal basis expansions 
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Let {ty n } be a complete orthonormal L 2 (a,b) basis. Then 

n 

Sn(*,y) = £&(z)&(y), x ^ G M) (24) 

i=0 

are delta sequences. In case of trigonometric functions, we again obtain the Dirichlet 
kernel delta sequences given in the Examples 1 and 3. A Hermite function expansion is 
given by 

where H 2k {iil\/2) is the usual Hermite polynomial. This delta sequence was studied by 
Schwartz [44], Korevaar [36] and was independently rediscovered by Hoffman, Kouri 
and coworkers [25] in a more general form. Various other cases can be found in Walter 
and Blum's reference [39] and these also have been studied in very general forms by 
Hoffman, Kouri and coworkers [28]. 

WAVELETS AND DELTA SEQUENCES 

Wavelets have been widely used as an analysis tool for various applications. The 
essential reason for this is because both orthogonal and nonorthogonal wavelets can 
provide a decomposition of a function at a variety of different scales. In other words, 
wavelets form special L 2 (R) bases or frames for representing a function at various levels 
of detail, leading to so-called mathematical microscopies. This turns out to be very 
efficient for approximating and analyzing functions in many applications. Orthogonal 
wavelets and multiresolution analysis have been successfully used in a variety of 
telecommunication and engineering fields [10]. They play a special role in those 
applications where orthogonality is strongly required. In many other applications, 
nonorthogonal wavelets, or frames, are also very useful. Generating new types of 
wavelets has been of great importance in wavelet theory. Most of the delta sequences 
described in Section II can be regarded as scaling or father wavelets. .These father 
wavelets can also be systematically transformed into mother wavelets. The orthogonal 
wavelets are briefly reviewed in the first subsection. The connection between delta 
sequences and wavelets is made in the second subsection. The construction of mother 
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wavelets from various delta sequences is discussed in the last two subsections. 

Orthogonal Wavelets 

The formal theory of orthogonal wavelets on L 2 (R) has been presented in many 
books [1-3]. An orthogonal wavelets system is usual generated by a single function, 

either a father wavelet (pox 2^ mother wavelet by a standard translation and dilation 

technique 

4>mn{x) = 2-7^ ^JL _ n j f TOjTl e Z] (26) 

^mn(x) = 2"?V (^r- 71 )^ m > n 6 ^, (27) 

where the symbol Z denotes the set of all integers. This can be formulated rigorously in 
terms of amultiresolution analysis, i.e., a nested sequence of subspaces { VJ y m 6 Zsuch 
that 

1. {<f>{x — n)} is an orthogonal basis of V Q \ 

2. •••C7 1 cKoCV-,C'-Ci 2 (iJ) i 

3. /(*)€ V««/(2x)6 V m - i; 



4. n m y m = {0} and = L 2 (R). 

Since ^6r o c F„ it can be expressed as superposition of { <p X n } which constitute 
basis for V. , 

= I>n*i,n, (28). - 

where {a n } is a set of finite coefficients. 

For an orthogonal system, the subspace F , can be further decomposed into its 
orthogonal projection in V 0 and a complement fV 0 

where JV 0 is a subspace spanned by orthogonal mother wavelets { tffy . In general, ft min , 
neZ is an orthogonal basis of W_ m and 
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0 W m = L\R). (30) 

It follows that ^ min , {m, n E Z) is an orthogonal basis of L 2 (R). Similarly to Eq. (28), 
the mother wavelet can also be expressed as a superposition of { $ „} - 

W = £M, n , (31) 

wherei n = (-l) B a,.,,. 

Perhaps the simplest example is Haar's wavelet system [2] which is given by 

#x)=X [0 .i](*)> *e characteristic function of the interval [0,1]. It obviously has 
orthogonal translations. The dilation of (f) (x) results in characteristic functions for 
smaller (or larger) intervals and each of them spans a subspace V m by translations. 

It is not obvious that a multiresolution analysis exists for (f> other than the 
Haar system. The construction of the first few orthogonal wavelet bases was more or 
less an art rather than a systematic procedure; it required ingenuity, special tricks and 
subtle computations. One procedure, due to Meyer [1], is to begin with a spline 
function 6(x)=(l-|x-l|)x [0t2 ] which, by translations, generates a nonorthogonal Riesz 
basis (a frame of the lease possible redundance. The corresponding orthogonal basis 

{ (p} is represented in Fourier space as [add from text] 

where a(£) is function to be determined. Using both the orthonormality requirement, 

*0,n=< &>,0,6),n>, ( 32 ) 



and the periodicity, ^can be resolved as 



^0 = -^ry 3 v 

Daubechies presents another approach for constructing orthogonal wavelets. 
In the Fourier representation, the dilation equation can be written as 

flo = mowm/*)' (34) 
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where m 0 is a 27t-periodic function. The orthonormality condition then requires 

|m 0 (£/2)| 2 + |mott/2 + '0| 2 =l. (35) 

It turns out that if the set of expansion coefficients a n of Eq. (28) are chosen as 
ao=v(v-l)/(v+l)y2, a,= -(v-l)/(v+iy2, a, « (v-l)/(v+l)/2, a 3 = v(v+iy(v+l)i/2 (veR), 

then Eq. (35) will be satisfied and consequently 0can be found recursively. 

Delta Sequences as Father Wavelets 

Let {6J where a ~* a 0 be a sequence of C" functions on ( -°9, °°) which are 

integrable over every bounded interval and 

1. L(0) = 1 for each a; 

2. Hm ar . ao *atf)-*l for all 

3. for every constant 7 > 0, (jTi + / 7 °°) $a -> 0 as a a 0 ; 

4. and || xS a (x) || 00 < 00 for all x and a. 

Then the {6 n } are delta sequences and each function can be admitted as a father wavelet. 
We call this class of father wavelets "delta-sequence-generated father wavelets (J) a \ The 
corresponding "delta-sequence-generated mother wavelets \|/ a " have the Fourier 
transform property 

V>a(0) = I «(*)<** = 0. . 136) 

It is natural to view delta sequences as father wavelets. In particular, if the delta 
sequence has the structure that 6 a =(l/a)p(£/a) and Jp(x)dx=l as is the case for many 
examples given in Section II), then 6 n is a sequence of father wavelets at different scales. 
In contrast to the delta distribution which has only a point support, a function in a delta 

sequence can have an arbitrary support, depending on the scale. In the limit a "* a^ y the 

delta sequence converges to the delta distribution and the support shrinks down to a 
point. The resulting delta distribution actually helps* to furnish the""wavelet 
multiresolution analysis [37] 



WO 99/46731 



PCT/US99/05426 



226 

{6} ©©W m = L\R) % (37) 

where {6} is the space spanned by the delta distribution. This is in contrast to Eq. (30). 
Clearly, if a delta sequence is an orthogonal system, such as Dirichlet's continuous delta 

sequence, for a fixed a ^ we have 

{*.}©0Wm = {f}©©W'm = (38) 
— oo m&Z 

where {8 n } spans the wavelet subspace F 0 . Hence the orthogonal delta sequence spans 
the wavelet subspace W m for an appropriate choice of a. 

Delta sequence generated mother wavelets can be constructed in many different 
ways. We discuss two approaches in the next two subsections. 
Wavelets Generated by Differentiation Pairs 

For a given C" father wavelet (j>, we define a family of wavelet generators 

„ (n , d n d n ' 1 

G =X ^ +n ^' n = G,l,2,...,m (39) 
for generating a family of m + 1 mother wavelets 

^ an(l):=G (n)^ (x ) for <f>eC m and n = 0, 1,2, • • • ,m. (40) 

It is noted that this approach is not restricted to the delta-sequence-generated:wavelet and 
is actually a very general and efficient way for creating wavelets from a given C 1 father 
wavelet. The transform prescribed by Holschneider [6], Eq. (2), is a special case of our 
family of wavelet generators. 

Our wavelet generators are closely related top the transformation Lie group of 
translations and dilations. This is because the Fourier images of distributions 



form an infinite C* dimensional wavelet Lie algebra with elements {A r ;t =^ , * / 4|n=0,l,2, 
. . .} (here we follow the convention that statements concerning the structure of a Lie 
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algebra are made only on the basis of the real Lie algebra). The whole Lie Algebra 
structure of {X n } is simply given by 

[X ny X m ] = {m-n)X m + n - 2 n,m = 0,l,V". (42) 



X x generates a one-parameter non-compact abelian group which is obviously the 
translation group in momentum space. X 2 generates a dilation group. There are two 
nontrivial invariant subalgebras 

[X u X 2 ) = X l] and (43) 
[*i,X 2 ] = X u [X U X 3 ] = 2X 2 , [X 2i X*] = X 3 . (44) 

X u X 2 are respectively the infinitesimal generators of a two-dimensional translation and 
dilation group. The third element^ is a quadratic dilation (superdilation), which allows 
us to generate another invariant subalgebra, Eq. (44). This result indicates that our 
method of systematically generating wavelets is very general and has a mathematically 
rigorous foundation. More details will be presented elsewhere [45]. 

Example L Mexican hat wavelet and generalizations j - ^/?^ 

If we take Gauss's delta sequence as a father wavelet ^^^"izv^X £ 
then Eq. (40), for n = 1, is 



which is the well-known Mexican hat wavelet [3]. Taking n = 3 yields 



7TO£ 



which is an interesting "Mexican superhat wavelet". This wavelet is expected to perform 
better than the Mexican hat for some application. Since elements of Gauss' delta 
sequence are C° functions, there, are infinitely many Gauss-delta-sequence-generated 
wavelets given by 
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71 = 0,1,2,.. 



(47) 



It is seen that the celebrated Mexican hat wavelet [3] is just a special case of Eq. (47). 

It is interesting to note that all higher order Hermite functions (h*0) are mother 
wavelets, while the lowest order Hermite function is a father wavelet. This can be 
naturally seen from the orthonormality condition 



Here, if the second polynomial if fixed as m - 0 to a given constant, then the only case 
to H n to give a nonzero integration is n = 0 which determines a father wavelet. All other 
7/ n (n*0) give rise to mother wavelets. We point out that this result is not limited to 
Hermite polynomial. It is generally true for all polynomial systems which form 
orthogonal L 2 (a t b) bases with respect to an appropriate weight. More details about the 
general connection between wavelets and conventional Hilbert space bases will be given 
elsewhere [46]. 

Example 2. Shannon 's wavelet family 

Dirichlet's continuous delta sequence is related to the well-known Shannon's 
father wavelet Vdxf)^ tt >c ' . The latter is known for generating an orthogonal 
basis for a reproducing kernel Hilbert space. A family of (mother) wavelets can be 
generated by using our wavelet generations, Eq. (39) 




(48) 




= -sin(ax) for n = 0 



7T 



= cos(aar) for n = 1 




7T 
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-sin(ax) for n = 2q 



(-l)9a 2 9 +1 



cos(ax) for n = 2q + 1. (49) 



TT 

These results are in contract to Shannon's wavelet, Tfjd?**^*} ^C^X^J 
Obviously, all of these wavelets can be used to generate orthogonal wavelet bases by the 
standard method of translations and dilations. It follows that if the starting father wavelet 
generates an orthogonal system, then, the corresponding wavelets created by our wavelet 
generators, Eq. (39), are also orthogonal systems. * ^ 

It is very easy to construct various delta-sequence-generated wavelets by applying 
examples given in Section II to the right hand side of Eq. (40). 
Wavelets Generated by Difference Pairs 

Another simple and efficient way of generating wavelets from delta sequences is 
to take the difference between two normalized elements in a delta sequence 

Example 1. Hermite wavelets and the Mexican hat wavelet 
In case of Hermite' s delta sequence, Eq. (25), we have 

= 8 ' £. IT) USA^W <«> 

This is a general expression for a family of nonorthogonal wavelets. In particular, if n 
= 1'= 0, we obtain 

V' 1 .o(x) = i ^(l-x 2 )e^. (53) 



This is, once again, the well-known Mexican Hat Wavelet [3]. The Hermite wavelets 
described in Eq. (47) can easily obtained, within a constant difference, by appropriately 
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choosing n' = n - 1 in Eq. (51). 

Example 2. Shannon s wavelet 

We can use Dirichlet's continuous delta sequence as a father wavelet 
_. Then the corresponding mother wavelets generated by 

Eq. (50) are 

^. (s) = If^M _ If!^ for «^0. (54) 

7T X TT X 

This family includes the well-known Shannon's wavelet [3] as a special case 

i>2*A x ) = — (sin(27ra;) - sin(Trx)] . Y55) 

It is easy to check that this Shannon's wavelet generates an orthogonal system. Equation 
(55) is in contrast to the other family of Shannon's wavelets, Eq. (49), produced using 
our wavelet generators. 

Example 3. Gauss' wavelets 

It is noted that this procedure of generating wavelets is also very general. For 
example a wavelet can be constructed by combining a pair of functions from the Gauss' 
delta sequence, 

^(s) = -U^"' - 4fl e-W for a ^(3*0 . (56) 

Note that this is not a special case of Example L 

In the case where there is more than one delta parameter, the corresponding 

wavelets can be generated as differences of cross terms. This will be discussed 

in detail in Section V. 
DISTRIBUTED APPROXIMATING FUNCTIONALS 

Definition 5. A function is said to be of the Schwartz class if it is a C 00 function 
and rapidly decaying, 

sup | zt&jteix) |< oo, J$7) 
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for a*0 and all pairs of z and j. The set of all functions of the Schwartz class is denoted 
by 5. 

Definition 6. Distributed Approximating Functional (DAFs) .are families of 
functions which satisfy the following: 

1. They are sequences of Schwartz class functions on the real line R; 

2. They are multiple parameter sequences and each parameter independently 
leads to a sequence of functions which converges to the delta distribution 
6 

3. They are delta sequences of the Dirichlet type. 

Definition 7. A function on ( - 00 , °°) is said to be of the DAF class if it is 

integrable over every bounded interval and is of at most polynomial growth 
asymptotically. 

Fourier transforms of Schwartz class functions are still Schwartz class functions. 
DAFs are smooth and rapidly decaying functions in both Fourier space and ordinary 
space. We note that these properties are crucial to the usefulness of DAFs in various 
numerical applications. DAFs are a multi-parameter system. The best computational 
efficiency can only be obtained in certain regions of the parameter space. For Hermite 
DAFs, we call these well-tempered regions. (Note that well-tempered used here has 
nothing to do with the tempered distributions which arise naturally in the theory of 
Fourier transforms.) By well-tempered, we mean that the DAF approximations to a DAF 
class function are of the same level of accuracy both on and .off a grid or DAF 
approximations of the derivatives of the function have compatible accuracy to that of the 
function. By compatible accuracy, we mean that the accuracy decreases less than a factor 
of two as the order of differentiation increases by one. (Note that pointwise 
differentiability is not required for the DAF class functions because generalized 
derivatives exist for the DAF convolution of a DAF class function. Certainly generalized 
derivatives are, in general, not necessarily functions but are distributions. That is how 
DAFs can lead to distribution solutions to a partial differential equations, and why they 
turn out to be extremely powerful for approximating functions which have discontinuities 
and even singularities on a set of measure zero.) An alternative way of understanding the 
well-tempered region of DAF parameter space is to view it from the wavelet 
multiresolution analysis point of view. Essentially, there is an interplay of three factors: 
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(i) the DAF expansion of order M (which determines a father wavelet), (ii) a DAF 
window size a (which determines the scale or correlations), (iii) the DAF central 

frequency ~ 1 /A (which is equivalent to a dilation parameter). The best computational 

efficiency is achieved when, for a given M, the DAF window size is proportional to the 
central frequency, o«A. 

The last condition in our definition of DAFs reflects the fact that the most 
important DAFs we have discovered so far are delta sequences of the Dirichlet type. 
From the point of view of approximation theory, delta sequences of the Dirichlet type 
are, in general, more rapidly ohvergent than those of the positive type [39], However, 
for providing unbiased approximations, the delta sequence of the Dirichlet type is 
applicable to a smaller class of functions than that of the positive type. Specifically, the 
function is required to be at lease Holder-continuous or C 1 . This requirement is not 
needed for the DAF approximator because DAFs are of Schwartz class. An additional 
benefit of DAFs being of Dirichlet type is that DAFs are indeed "small waves" [2] and 
are readily admitted as father wavelets. Corresponding DAF mother wavelets can be 
generated by the techniques described in the last section. 

In parallel to the orthogonal wavelets and nonorthogonal wavelets, there are two 
classes of DAFs: orthogonal DAFs which are generated by orthogonal basis expansion 
of the delta distribution and nonorthogonal DAFs which are created by methods of 
regularization. These are discussed in the following two subsections. It-is noted that 
both orthogonal DAFs and nonorthogonal DAFs are frames, rather than orthogonal father 
wavelets. 

A. DAFs Generated from Orthogonal Systems 
One of the most important DAFs is the Hermite DAF (HDAF). It was first 
introduced in a discrete from [25] by Hoffman, Kouri and coworkers for quantum 
dynamics and then in a continuous form, by Kouri, Hoffman and coworkers [26], as 

W(« - *' I * , *) = \ exp i^^ 1 ) E (t)" *' n (^) ' (58) 




are the usual even Hermite polynomials. While the 



Hermite expansion of the delta distribution, Eq. (25), was discussed by Schwartz [35] and 
Korevaar [36], the Hermite DAF is very different, at least where applications are 
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concerned, because it is explicitly a two-parameter delta sequence. This has the 
consequence that the Hennite DAF %ffhAfC)C- X'\ Wi ^ converges to the delta 
distribution either when the degree of the polynomial M goes to infinity. 

jfe> S $jr( x = 6(x - x') (59) 

independent of o^O or in the limit o -> *>, 

Km S HDAF (x -x'|M,(t) = 6(x - x') (60) 



independent of 0 < M< °°. Having two independent DAF parameters is cnicial to its 

success in various numerical applications, because adjusting two DAF parameters leads 
to tunable accuracy within well-tempered regions. Without the introduction of the 
dilation parameter o in the Hermite expansion of Delta distribution, Eq. (25) itself would 
have very little numerical utility. Well-tempered Hermite D AFs have been used for a 
variety of numerical applications, including among others, filtering and fitting 
experimental data and ab initio quantum mechanical potential surface values, padding 
data on two- and three- dimensional surfaces, resolving eigenvalues of a Hamiltonian 
solving linear and nonlinear PDEs, and signal processing. Some of these applications 
involve estimating the derivatives of a function known only on a finite; discrete grid, 
which are given in our DAF method in the sense of a distribution [25], 

/(*)(*) = < 6% AF {x - x' | Af , cr), /(*') > (61) 
= (-1)' < S HDAF (x -x'\M, cr), /(')(x') > , (621 

where &$ AF {x - x' \ M, a) is given by (25) 

(63) 

As stated earlier, the Hermite DAF is a Schwartz class function for non-zero o and the 
finite M, the right hand side of Eq. (61) exists in a distribution sense of/ 0 is not well 
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defined. This situation can occur in solving PDEs for which Hermite D AFs can smooth 
out the discontinuities and "round off' singularities so as to provide numerical solutions 
(so called "weak solutions"). This is the case in solving the Fokker-PIanck equation 
when the initial distribution function /is a Dirac delta distribution.^ Hermite DAFs have 
successfully estimated the first and second derivatives and integrated the Fokker-PIanck 
equation to an L 2 error of the order of 10" 12 [30]. 

The Fourier space representation of the Hermite DAF is given by [27] 

ShdafU I M y er) = exp I — J ^ ~ { • \PV 

This has the important property that 




~^&U2AeI§)\ M > a ) = 1 v ^^°°) M=l,2 r -*. (65) 



Hence, Hermite DAFs are readily admissible as father wavelets. In parallel with Eqs. 
(59) and (60), it is easy to see that Hermite DAFs become a Fourier space all pass filter, 

i.e., Cf y \ M^-l whenever M — 00 or a - 0. For a given set of Hermite 

DAF parameters I/i f «*Q>?*0 &fipftg ( fyfyS?) is an infinitely smooth, exponentially 
decaying low pass filter which is extremely useful for signal analysis. 

DAFs of Fejer type can be easily generated by using our Hermite. DAFs 

Wf*(> -x'\M,<r) = + 2) <*P[ 2 ^ J 

Ss(t)"^(^)- < 66 ' 

A somewhat more general approach is to sum selectively 

2 f—(x-x') 2 \ 

SHDAFfciirlx - x' |,M,M» = ^jy _. jtf / ) «P [ ^ J 

XS(t) TSi^te)', (67) 



We point out that since DAFs are multiparameter delta sequences, averaging over 
other DAF parameters will also generate DAFs of Fejer type. We shall not, 
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however, give an exhaustive list of these possibilities here. 

A particular, special case of our Hermite DAF (for M = 2) has been 
previously discussed by Monaghan and Gingold [47} in terms of a super Gaussian 
kernel, 



(Note that this is different from the Mexican hat wavelet.) These authors noted 



integrations and performs better than the Gaussian kernel. 

More general DAFs based on orthogonal basis expansions can be 
constructed in two ways, as described in Ref. [28]. Essentially, for a given 
positively defined weight function w in a domain (a, 6), it is possible, by using the 
standard three term recurrence method, to generate a set of polynomials P n which 
is orthonormal under the inner product with respect to the weight w. This results 
in very general orthonormal L\a, b) bases. All classical polynomial systems, such 
as Hermite, Legendre, Jacobi, Chebyshev, etc. are special cases of this approach. 
This procedure of generating arbitrary orthonormal systems is well-known and has 
been employed by Shizgal [48] for constructing various reproducing kernels, 
which, when limited to the classical weight function are coincident with the 
discrete variable representation (DVR) [49} . If the weight function of a general 
orthonormal basis is chosen to be of the Schwartz class a general DAF can then 
be constructed by expanding the delta distribution in the domain {a, b). The 
resulting DAF is an approximate identity kernel and is very similar to the 
reproducing kernel of Shizgal for a common weight w. However, there are 
important differences in the philosophies of the two approaches. The grid points 
in the Shizgal' s method are always restricted to the whole set of nodes of the 
polynomial of highest degree occurring in the expansion. As a result, both the 
function and its derivatives are approximated in a global maimer, necessarily 
involving all grid points in the domain (a, b). A major advantage in the DAF 




(68) 



that this super Gaussian kernel gives interpolation error of 0(a 4 ) in numerical 
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approach is that there is no restriction on grid selection. It turns out that because 
of the nature of the Schwartz class DAFs, numerically, the DAF approximation to 
a function and its derivative at each point x is effectively a localized one, with 
contributions coming only from near neighbor points, x\ - The range of the 
neighborhood is controlled by the DAF window parameter, o in case of Hermite 
DAFs. The functional relation determining the contributions of neighboring grid 
points is given by a pointwise basis set expansion of the delta distribution and 
controlled by another independent DAF parameter. Therefore, the DAF approach 
is a pointwise spectral method and its matix representation is banded. It is well- 
known that, in general, global methods are more accurate than local methods, 
while local methods are flexible for handling complex boundaries and geometries. 
Therefore, linear systems with simple boundary conditions are preferably solved 
by global methods, while nonlinear systems with complex boundaries are usually 
solved by local methods. However, in a wide range of numerical applications, 
such as long-term weather predicting, describing shock waves in compressible gas 
flow, or vortex sheets in high Reynolds number incompressible fluid flow, and 
studying critical points of Bose-Einstein condensations, one deals with nonlinear 
PDEs possessing singularities or phase space homoclinic orbits [11,12]. The 
accuracy of approximations to derivatives becomes particularly important because 
homoclinic orbit crossing can induce numerical chaos [12], Hence, it is desirable 
to have an approach that has global method accuracy and local method flexibility 
for treating the above mentioned highly demanding systems. Spectral element 
approaches [50], which combine the spectral method with the finite element 
method, have been studied for this purpose. A general difficulty in this approach 
is the interfacial matching between various spectral subdomains. This leads to a 
great reduction in accuracy. We find that DAFs provide a robust alternative 
approach for solving such problems. 

A second way to construct general DAFs is b'ased on the variational 
principle also described in Ref. [28]. A cost function X is defined as 
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(69) 



where w is a positive weight function, {^} is a set of functions and {ocj} is a set of 
expansion coefficients to be determined by local minimization of X {{aj} ; x). It 
turns out that for an arbitrary local integrable function/ the set of coefficients {(Xj} 
that minimizes the X ({a-} ; x) is the same as that which provides an orthogonal 
basis expansion of the delta distribution under the same weight w. We refer the 
reader to Ref [28] for more details. It is noted that in this approach, each point 
can have its own basis set, independent of the basis sets used at neighboring 
points. This is equivalent to using a different basis for expanding at each grid 
point. Since this result is very general, we refer to it as the "DAF variational 
principle." This approach has been shown to be extremely powerful in molecular 
potential surface fitting [51], as well as in a variety of signal processing 
applications. Similar approaches exist in the mathematical literature; for example, 
moving least squares have been discussed by many authors [52]. 

Orthogonal basis set expansions of the delta distribution have been 
described in very general form in Ref. [28]. A historical summary of orthogonal 
basis-derived DAFs can be found in the reference section of Ref. [28]. 
DAFs Generated by Nonorthogonal Systems 

1 . Fourier Space Regularization 

Definition 8. A tempered distribution on R is a linear mapping T: S -*R 
such that for some positive integer N and a constant C 

\T[fi\<C £ sup|x m aV(x)i V^G5. (70) 

m+n<N 

The space of all tempered distributions on R is denoted as S\ The 
examples of tempered distributions are all polynomials and allZ l ,i 2 functions and 
even the periodic delta distribution ^if.—^o £ (Y~2flk) . However, "neither 
— - £ fy-i*fc) nor e^ 1 are tempered. Since DAFs are functions in the 
Schwartz class, tempered distributions are particularly important to DAF 
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It is obvious that = 1 for all a. We construct the Shannon Gabor 

wavelet-DAF as 



6sGWD{t\w) = T*** (?4) 



This is a smoothed function, which has the property that 

$sgwd{0 I cr,Tj) = 1 (75) 

The function tOaff ) is a regularizer and 

(76) 



as o - °°. We thus recover exactly the ideal low pass filter function in 
the limit of our regularization. Moreover, for a given o, 



lim SsgwdH I W) = 1 V * 6 R y^\S> 



MT>^ (77) 



as required for DAFs. 

The inverse Fourier transform gives the Shannon Gabor wavelet-D AF [33] 

1 sin(77x ) ~ 

Wd(* I = ^[SsgwdU I <r,v)){x) = 2*Tw = -— (78) 



This is still a two-parameter DAF such that 



Jim 8 SG WD{x\er, V )] = 6(x) V<r,z e R ^ 
Kn8sGWD(z\cr tV )] = 6(x) V Vi xeR 



The Shannon Gabor wavelet-DAF can be regarded as a Gaussian regularized 
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Dirichlet continuous delta sequence in the coordinate representation. 

In numerical applications we choose J ^* "^w) fbr a discrete 

version of Shannon Gabor wavelet-DAF 

r / i a \ 1 sin [%{* ~ x n)} , v 

dsG^z?(i - z„ I cr,A) - — — ^ r-^e t^. (81) 

This form has the interpolating property. 

Example 2. Generalized de la Vallee Poussin DAF 
We choose as the following 



t„,a(0 = { \ri-tn v<\t\<*n > ( 82 ) 

0 otherwise 

where hlo >>> Let SwfflfojV) with given by Eq. (73). Then we can 

construct a corresponding DAF by jr y £ 



W(»,i?,A,o») = T- l [S*w a ) = 2*Tui = 1cos M-cos(At7x) 



(A - 1)tjx 2 ■ (8 #" 



This DAF reduces to the Shannon Gabor wavelet-DAF, Eq. (78), in the limit of 
^ '~ S> \ and to the de la Vallee Poussin DAF [32] when , ^ ~ 2- . The 
latter has been tested for some numerical applications, including.solving PDEs 
[32]. 

Example 3. Arbitrary Schwartz class filters 

As discussed in Ref. [53], to design a general, smoothed filter, which can 
be low pass, high pass, band pass or band stop, we choose a Schwartz class 
function W£ S which satisfies 



/ MM = 1 



(84) 



WO 99/46731 



PCT/US99/05426 



and \A%CV * Its Fourier inverse satisfies 



0 < w.[z) < 1 Vi 6 R. (85) 



Equation (84) implies that U^to) =r 1 . Let 71. y- (-5 ) be a piecewise 

smooth function characterized by a set of parameters X and satisfying 



0 < f aiiQ2 ,..(0 < 1 V£e£. (86) 



Then we have a generalized Schwartz class filter 

Sdaff(x) = ^[T * t&l(a) = 2irT(*M*)- ( 87 ) 

This obviously includes Example 1 and Example 2 as special cases. 

Another interesting special case is given by choosing 
1 0<\t\<<* a>0>O 



(88) 
0 otherwise 



and given by Eq. (73). We then have a smoothed high pass filter 

&hpf(* I 0-,O£,/3) = ^[sin(ax) - sin(£:c)]e~S^ (89) 



We note that this is equivalent to a DAF-wavelet. Equation (89) reduces to the 
Shannon Gabor wavelet-DAF when ^ - ^ . 

The method presented in this section is very general for the construction of 
arbitrary smooth filters and DAFs. (The desired degree of smoothness is obtained 
by appropriately choosing the regularizer.) We refer the reader to Ref.~[53] for 
more details. 

2. Interpolating Formulae and DAFs 
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Since a Fourier space convolution leads to a coordinate space product for 
a certain class of functions, it is clear that regularization in Fourier space by a 
Schwartz class function is equivalent to a coordinate space regularization, by a 
Schwartz class function. Therefore, coordinate space iS-function regularization of 
commonly used interpolating kernels leads to smoothed kernels, or DAFs. The 
most well-known such kernel is the Lagrange interpolating formulae 

in(/, x) = £ «.•(*)/(*.") Vx t - e (a, b) (90) 
where Z( 00 is the well-known Lagrange kernel given by 

'.■(*) = ft Vx.^eM) (91) 

(Note that by using the summation convention, Eq. (90), all kernels and DAFs 
presented in this and the next subsections differ from those in the previous sections 
by a factor of Vl , where h is the grid spacing.) Lagrange interpolating 
formulae yield polynomial approximations to f(x) when its values are given on a 
set of n node points {£_}. If the nodes are chosen to be equidistant, 
V t*-H7(t-f)/fl-/ t-ljZ,^) A , then integration gives the famous Newton-Cotes 
quadrature formulae. We have constructed Lagrange DAFs by introducing a 
Gaussian weight to the Lagrange interpolating formulae 

w-*i»>- n T^f^- (92) 

The most important difference between our Lagrange DAF and the Lagrange 
interpolating formulae is the way in which nodal points )^ } X j are chosen. The 
nodes of the polynomials are always chosen around each central point 

"jC^ ^ (Aib) . Therefore when Yl is close to the boundary a or 6, sortie nodal 
points will fall outside of (a, Z>). This causes our Lagrange DAF 

approximation to have translational symmetry and be asymptotically unbiased in 
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the whole computational domain (a, b). This way of choosing grid points is due 
to the fact that DAFs are essentially a (^ y oO ) domain method rather than a 
bounded (a, b) domain method. This reflects the nature of the Schwartz space 
weight function. However, being a (- ) domain method, does not mean we 
have to use the nodes at m k<^=> and it also does not mean that we cannot use 
it for finite domain (a, b) calculations. On the contrary, the Lagrange DAF is 
more flexible for arbitrary finite domain computations, because the S-space weight 
function makes our Lagrange DAF kernel an effectively localized one. The 
Lagrange DAF matrix is effectively banded. In principle, on a portion (2M +1) 
of all the grid points which are closest to %l must be used in constructing an 
approximation to the kernel ~SlmpC{-Xl If the nodes are chosen to be 

equally spaced, then we only need to calculate SimpCx-X^ l2> ) once for all 

More general interpolating DAFs can be constructed by using more general 
Lagrange interpolations. Let l^r {%) y I g / 7. *~ be a set of functions 
satisfying 

^ 1 6(*;) = *.j. (93) 
As a result, the generalized Lagrange formulae 

i=l 

will have the interpolating property, so that 

in(/,im) = /W, m=l,2,---,n, 




providing that the ><J are n distinct nodes. A very efficient way for obtaining 
60 of the Lagrange type is to choose a nodal function ary) and a basis 
function defined in an open interval around every node, and which have 
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at least the first derivative at every node, satisfying 

/*„(*,•) = 0, 0 <| n' n (xi) \< oo Vt=l f 2, — ,n 
i/,(x,) = 0, 0 <| v&Zi) \< oo Vt=l,2,-..,n 

ui(x m )?0 Vt^m. (95) 
The generalized Lagrange kernels are then obtained by making 

This satisfies Eq. (93) and consequently Eq. (94). A corresponding generalized 
Lagrange DAF can be constructed as 

where u> ' £ S is a Schwartz class function such that 

0 < Wi(x) < 1 and iz;,(x ( ) = 1 Vx £ i2. (98) 

Eq. (97) is a general expression for a large class of interpolating DAFs. 
Example 1. Lagrange interpolation 
The Lagrange kernel, Eq. (91) is obtained by choosing 

n 

pni*) = JJ(x - xj) and 
i/i(x) = x - x t -, i = l,2,--.,n, then 




x-x; 



= II (99) 



The corresponding DAF given in Eq. (92). 
Example 2. Cardinal interpolating 

Considering a set of equally spaced nodes f X { ^ g ^ 0 } 4: A ,±? A \ ^oo/and 
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fi n (x) = sin(— x) and 
Ui(x) = x - iA, i = 0,±l,±2,--*,±oo; 
then 

^ (-^M i=0 , ±i±2 ,... i±oo . 

K (x-iA) 



(100) 



This is a different form of the well-known sine interpolating formulae and a 
corresponding DAF can be constructed as 

Ssgwd(x - x { | or) = l » (a ,-A) e ~^~ * = 0,±A,±2A, - ,±M. (101) 

This is a different form of the Shannon-Gabor wavelet-DAF. As in the Lagrange 
DAF case, we only include 2M + 1 nodes in the calculation. Later in this 
subsection, we show that this is also a special case of a generalized Lagrange DAF 
(derived for evenly distributed nodes). 



Example 5. Trigonometric interpolation 
Let lXM ^ 1 and 



*,'(*) = sin(£(x-: C| .)) X{ = 1, 2 , . . • , U 



2 

n 

Mn(x) = II l/ .(«); 

1 

then 

V,(e) -2^(,,.)sin[ f («-,,.)] W 

is an interpolating kernel. Obviously it has the general form of "a Lagrange 
interpolation. We construct a corresponding DAF as 
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Eq. (102) is simplified for the case of equally spaced nodes 
)t ,'»-ifSinjt^U-jth to give 

This is recognized as the Dirichlet kernel arising in Fourier analysis. The 
corresponding DAF can be written as 

wot* - * iV) - £§;&$ ' J * t - {m) 

This is our Dirichlet-Gabor-wavelet-packet-DAF, which was introduced when the 
connection between DAFs and wavelets was first made [32]. As pointed out in 
Ref. [62], the Shannon-Gabor-wavelet-DAF can be regarded a special case of Eq. 
(105) because 

Ssawo(x - *.• | <r) = lim sin ^ ~ */) -fegft 
~~(2»+l) sinful 

sinf(i-x,) 



All of the examples in this subsection have been tested by numerical applications. 
They all have similar levels of numerical accuracy for interpolating functions and 
for numerically solving linear and nonlinear PDEs. It is noted that the prescription 
given in Eqs. (95) and (96) is very general and efficient for constructing 
interpolation kernels. One can easily write down as many more interpolation 
expressions as are desired. As a consequence, one can generate various 
generalized Lagrange DAFs by using Eq. (97) with S-class weights satisfying Eq. 
(98). It is beyond the scope of this paper to give an exhaustive list ^ef all 
possibilities. (It is also true that there are infinitely many orthogonal series 
expansion DAFs as noted in Ref [28].) 

The connection between interpolation DAF and the "DAF variational 
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purposes. This idea led us to introduce Gaussian regularized Lagrange sampling in Ref. 
[57]. 

Example L Shannon's sampling 

Shannon's sampling theorem is one of the most important mathematical 
foundations for information theory and various engineering fields. It even influences 
statistical mechanics and serves as an important motivation for wavelet development. 
Shannon's sampling theorem addresses the issue of constructing or recovering a 
continuous function f(x) on the real line xeR from an infinite, discrete set of known 
values {f(x^} 



n=— oo 



where Bx/x is the Paley- Wiener space of functions band limited to r\, i.e., their 
momentum representations are identically zero outside the band r\h. This theorem 
provides connections between experimental measurement (which is discrete in nature) 
and theoretical predictions (which is continuous in nature). The reproducing kernel 

sin7?(s-y) __ ~ sin^x - x n ) sin7?(y - x w ) 

is related to Whittaker's cardinal series 

sin7?(s-x n ) = (-l) n sinqx ^ Q9 ) 

T/(X-X„) 7JX-717T - 

and to the Dirichlet continuous delta sequence, Eq (20), for appropriate choice of x\. It 
is also known for generating an orthonormal basis for the reproducing kernel Hilbert 
space Bx/x. By setting r\ = it, one obtains the well-known Shannon's father wavelet, 



tTX 



with the Fourier transform 4>(jc) = X , Vu % > And the mother wavelet- t|f(x) = 
Tl x as discussed in Section in. 

Shannon's wavelets are not efficient from a computational point of view because - 
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of their slow decay as x becomes large. This is implied from the ideal lowpass property. 
Mathematical sampling theory emphasizes the fact that expression (107) is exact. 
However, in the real world, since one cannot actually use infinitely many sampling 
points, the "exactness" in Eq. (107) is not physically realizable. Moreover, there is a 
well-known paradox [58] regarding the notion of band-limited signals. The usual 
definition implies that a band-limited signal is an entire function, whose Fourier 
transform has compact support. However, an entire function cannot have compact 
support unless it is identically zero in the entire domain. Therefore, it cannot be both 
band-limited and time-limited, unless it is identically zero. This is in contrast to the fact 
that physically realizable states are well-behaved Schwartz-space functions, which are 
effectively, but not exactly, both bandlimited and time-limited [59]. The bandwidths in 
the frequency and time domains are actually related by the Heisenberg uncertainty 
principle. To construct a practically useful sampling formula which does not demand 
infinitely many sampling points while providing as high accuracy as desired for an 
application, we employ the regularization procedure discussed above and choose 

w<r (x) = e"& (111) 



to smooth out the Gibbs 1 oscillations and consequently reduce truncation errors. This 
leads us to define an interesting Shannon-Gabor father wavelet (SGFW) as 



H x)ss ™H e -&. .(112) 



7TX 



Note that the Shannon-Gabor father wavelet is different from either the Shannon father 
wavelet <J> or the Gabor wavelet C c^w) ces^tx). It is interesting to examine 



two limiting cases of the Shannon-Gabor father wavelet: 

sin TTX 



lim $(x) = 



(113) 



and 



1 , . sin irx c/ % — i/v\ 
"-0 27TO- v ™ Vj J 



(114) 



WO 99/46731 



PCT/US99/05426 



249 

Shannon's father wavelet is recovered in the first limit (provided W -♦«>). The second 
limit, Eq. (114), indicates that the Shannon-Gabor father wavelet is a delta sequence. 
Using this fact, we can construct an interpolating Shannon-Gab.or distributed 
approximating functional (DAF) [33] 

Sdaf{x^ x h \<r) = —zf r-e (115) 

A [x — X n ) 

The advantage of this expression over Shannon's reproducing kernel, Eq. (108), is that 
Eq. (115) is effectively banded. The best a values form numerical purposes are 
determined by the dilation factor ~A(grid spacing). For a given A, there is a wide range 
of o's that deliver excellent numerical results. This Shannon-Gabor wayeletrDAF has 
been tested on many numerical applications and is extremely accurate and robust for 
numerical solutions of linear and nonlinear partial differential equations. 

Example 2. Generalized Lagrange Sampling 

Perhaps the most general sampling theorem is due to Paley and Wiener [60]. For 
an Lj function / which is band-limited to T|, its value fix) at an arbitrary point x can be 
exactly recovered from an infinite sent of (not necessarily uniform) discrete "sampling 
points" [x k ] 

(Note that this implies a value of A k , one then constructs the following- Lagrange-type 
interpolating series 



/(«) = E /(«*)&(«) (» € R)y { f (H7) 

where 



is a Lagrange-type sampling function. Here G(jc) is an entire function given by 
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GM=(x- Io) n(i-^)(i-^). 



(U9) 



and G' denotes the derivation of G. Equations (1 17) is called the Padey and Wiener 



of the classical Lagrange interpolation formula to the real line (R) for functions of the 
exponential type. Unlike the classical Lagrange interpolation formula, Eq. (117) contains 
infinitely many terms, and we stress that it yields the exact f(x) for all real jc. Thus, the 
interesting point is that the information of a continuous function (containing a compact 
set of frequencies) on the real line (R) can be entirely embedded in'-ah infinite, but 
discrete irregularly placed set of sampling points (grid points). Condition (116) is the 
best one can have [60]. There will be an aliasing error if the grid mesh is larger than is 
allowed by condition (1 16) or if the function f(x) is not band-limited to T|. The major 
disadvantage of Eq. (1 1 7) is that it converges slowly. In practice, neither computational 
nor experimental data can ever be obtained at an infinite set of discrete sampling points. 
As noted above, from a mathematical point of view, a band-limited (i.e., compact support 
in Fourier space) function cannot have compact support in the coordinate representation 
unless it is identically zero. From a physical point of view, physical measurements 
cannot be conducted for an infinite duration, therefore physically realizable states are the 
Schwartz-class functions [59], which can be treated as effectively band-limited in both 
the momentum and coordinate representations. This suggests that one can truncate Eq. 
(117) and still obtain reasonable results. A simple way of achieving this is to introduce 
a weight function w k (x) into the right hand side of Eq. (117). (This approach can be made 
rigorous by introducing the regularization in the momentum space, as discussed above.) 
This leads to the approximate equation 



sampling theorem in the mathematical literature and can be regarded as a generalization 



oo 



/(*)«E/( B *)5*(*H(»). 



(120) 



— oo 



A particularly robust weight function on the real line R is" the Gaussian. - 



w k (x) = e 




, (0 < a < oo). 



(121) 
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Note that the approximate equation (120) becomes exact in the limit that o k approaches 
infinity. Moreover, as o k tends to zero S k (x) w 4 (x) behaves like a "semi-continuous" Dirac 
delta function, 

li m J_ = — -%±—6(x - x k ). (122) 

«r*-o+ ^<r G'(x k )(x - x k ) G'(x k )(x - x k ) K k} 

This is effectively a delta function because 

Glx) 

Km ; ' - = 1. (123) 

Therefore the kernel S k (x)w k (x) of equation (1 20) can be approximated as an integral over 
x k , then one also obtains exact results as o k -+0 + . For a finite set of sampling points 
{x k } M k=l which are distributed in the nearest neighbor region of point x k , we have the 
following LDAF expression [29] 

M x _ x (*-*y) 2 

Su>JLF{*-*k\M 9 <r k ) = H -e "2 . / (124) 

t¥Jfc x k - x { ^ 

The Lagrange sampling theorem is very general and it includes Shannon's sampling as 
a special case of x k = kA = -x_ k . Then it is seen that Eq. (119) becomes 




Jb=i \ 



Jb=i 
= A 



1 ~ jfefe) ' (126) 



(127) 



Taking account of G'C*) =(-l k ), the equation 1 18 gives rise to 



r , A (-l)*sin^x - 
Sk{x) ~ f(x-*A) (128) 



_ sin f (s - x k ) 



(129) 
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Since the derivation is independent of the regularizer g>, it follows that our Shannon- 
Gabor-wavelet-DAF can be regarded as a special case of our Lagrange DAF. 

Example 3. Dirichlet (periodic) sampling 

Both Lagrange-sampling ans Shannon-sampling theorems hod for band limited 
functions on the entire real axis R. On might wonder what happens for a band limited 
periodic function. It is reasonable to expect significant reduction in the number of 
sampling pints required because of the periodicity. This is indeed the case. Star [61] has 
proved that if a function f(x) satisfies the Dirichlet boundary condition, is periodic in T 
and band-limited to the highest (radial frequency 27iM/T, then it can be completely 
reconstructed from a finite (2M+1) set of discrete sampling (grid) points [61] 

" sin U(x - x k )] 

k =- M (2M + 1) sin [f 

where A = T/(2M+1) is the sampling interval (grid spacing) and the x k = kA are the 
sampling points. Using our standard arguments about the contradiction between the 
band-limited and the physical world [62,33], and invoking the regularization procedure 
discussed in earlier sections, we then construct the following approximate sampling 
formula 

Sin fx(x — Xk)\ (r-r t ) 2 

f{x) * E /(at) r 2 M4.i • fv d 6 " 4 ^' (- 131 ) 

jb=-w (2M + 1) sin |f (x - x k )\ 

where W, the computational bandwidth, is smaller than M. The sampling (rid) pints {x k } 
are distributed around the point of interest, x. Obviously when o-*~, we recover the 
exact sampling theorem Eq. (130). It is also interestin to examine the limit of M with a 
fixed A: 

lim si » &' -**)] -^i = sin[p-x t) ] ^ fa j 

(2Jf + l) an [f («-.*)] *(»-■*) 1 M ' 

sco c -<.y-yo>^ _ . 

We call TkOi-*^ & the Shannon-Gabor Wavelet DAF [33] (SGWD), which 
is a special case of our previous Lagrange DAF [29,62]. Both the DGWD and the 
SGWD are generalizations of the (infinite grid) sinc-DVR [49,63], X (which is also the 



WO 99/46731 



PCT/US99/05426 



253 

semi-continuous Fourier DAF on a grid [25]). The DGWD reduces to the sinc-DVR in 
the simultaneous limits of M— ~ (with fixed A) and o-*~ and the SGWD reduces to the 
sinc-DVR in the limit of o-+«>: 

M u m = Um !!j[|(^)l e ,-^ 

'"[£(«-**)! , , 

i(x-x tl , : . ^ . < 133 > 
In this sense, our DGW and SGW can be regarded as a DAF-windowed sinc-DVR or a 

regularized sinc-DVR. Due to the Gabor window, the regularized sinc-DVR matrix is 

banded. This endows the regularized sinc-DVRs with great potential for applications in 

large scale computations. 

Stark [61] has demonstrated that the Dirichlet sampling theorem can actually be 

derived from Shannon's sampling theorem by appropriately imposing the periodic 

boundary condition. Therefore, all three sampling theorems are inter-related and both 

the Shannon sampling theorem and the Dirichlet sampling theorem are special cases of 

the Lagrange sampling theorem. 

Example 4. Sampling theorems in polar coordinates 

It is not obvious how to generate a sampling theorem in noncartesian coordinates. 
In polar coordinates, the Fourier-Bessel series plays the role of Fourier series expansion 
in cartesian coordinates. The notion of a band limited function is still important with 
band-limited referring to the Fourier-Bessel series expansion of the function which 
vanishes outside a circle of radius y. For an isotropic function f(t) banded limited to y, 
Camo [64] showed that 

oo 

M = E f(<*ik)S ik (r), and ( 134 ) 



where is the ith order cylinder Bessel function of the first kind and a^y is the Ath 
zero of Ji(-). This result was extended to a real function f(t,6) whose ith Hankel 
transform has compact support: 
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M 00 2™ 



/M) = n E M E /(«*, ajj^JftkCrje,.^), and (13 6 ) 

This form is useful for problems with circular boundary conditions. By our 
regularization argument, w propose a corresponding polar coordinate DAF as 

Si, M ( T ~ 9-8 n \ eik, = Si( T ~ I Qik)S M (6 ~ On I <r) (138) 
2aa J.(r 7 ) sin - 8 n )] _ l£=¥ _ 

This is obviously simply a product of two DAFs. It is noted that because of an irregular 
distribution of a ik , Q ik is not constant and its appropriate values need to be specified. 

DISTRIBUTED APPROXIMATING FUNCTIONAL WAVELETS 

Efficient and general procedures have been prescribed in previously for 
constructing "delta sequence generated wavelets" from the various delta sequences 
descried in previously. These methods are applicable for generating wavelets from the 
various DAFs described in the last section because DAFs are, at least form the point view 
of functional analysis, a special subclass of delta sequences. We shall call the resulting 
wavelets "DAF-wavelets", a terminology introduced in Ref. [32], where various DAF- 
wavelets were created for the first time by taking the difference of two DAFs. It is clear 
that the various methods described in the last section enable one to create as many DAFs 
as desired. Moreover, the methods prescribed in previously will enable one to generate 
infinitely many DAF wavelets from each DAF. Therefore, we discuss only briefly a few 
typical DAF-wavelets in this section. However, the discussion in this section is not a 
simple repetition of Section ELI because DAFs are multiparameter generalizations of the 
usual delta sequences. Being a subclass (Schwartz-class) and a multiparameter 
generalization of delta sequences facilitates conversion of the DAFs into a wide variety 
of DAF-wavelets. 

DAF-Wavelets Generated by Differential Pairs 
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A general express for various D AF wavelets produced by our wavelet generators 



G m is 



W(.|».a./J,...)- (.£L + m £l) W(.|^,...),»!.1 A -. (140) 

The DAF 6 DAF (x| m,a, p,—) can be any DAF discussed in Section V. The computation is 
straightforward for all DAFs providing that m is not too large. In the case of Hermite 
DAFs, Eq. (58), we have 

= S (_1) " +m v^2-«n! (^) 

, (141) 



where m, M/2 = 0,1,2,—. Here, some simple properties of Hermite functions have been 

used to simplify the results. The Mexican hat wavelet, /zt?<* j** )<g s 

and the Mexican superhat wavelet fziTX.(zfh ~ "ZT* "^"3/£ are given by 

*F 0 l (xlo) and Y 03 (xlo) respectively. In general, the series of Gauss-delta-sequence- 
generated wavelets, Eq. (47), are given as a special case of T 0t/n (xb), m =0,1,2,-. 

Since all of our nonorthogonal DAFs have the structure Tw, Eq. (140) can be 
written also as 

m 1 m— 1 / <t \ 1 

^*!(m-*)! ^i!( m _f_i)i 

(142) 

for nonorthogonal DAF wavelets. This form may be useful for the implementation of Eq. 
(140). 

The moments M* of DAF-wavelets generated using our wavelet generators G m can 
be calculated as 
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f k ( d m d m ~ l \ 



k+l< m ' 

^r^S' l -^s DiFix{a ^,.. Wk+ ^ m< 4 _ m+leven (143) 

* + 1 > m, k - m + 1 odd 
This expression is modified by a constant if the DAF-wavelets are normalized in L 2 (R). 

DAF Wavelets Generated by Difference Pairs 

A second class of DAF-wavelets is generated, in general, by 

*daf(x I a,/?,-.- a',/3', . . .) = S DAF (x | a,/3, . . .) - S DAF (x \ a',/?', . . .), (144) 

where at least one comparable pair of parameters, say a, a', are different from each other. 
For the Hermite DAF, we have 

W I - i«P (£) J ;j {^j {£) (1*> 

The Mexican hat wavelet is obviously a special case of this general expression 
specifically V mM (x\2 9 o\ 0,o). * - 

It is straightforward to generate wavelets from the Hermite DAFs of the Fejer 
type, Eqs. (66) and (67). Some simple examples regarding ourDirichlet-Xjabor-wavelet- 
packet DAF and Shannon-Gabor wavelet DAF are given in Refs. [32] and [33] 
respectively. Since DAFs already have been found to be extremely powerful for a variety 
of numerical applications, we expect that DAF-wavelets will play an important role in 
all of those areas where wavelet techniques are applicable. This is currently under 
investigation. 



CONCLUSIONS 

The general connection between wavelets and delta sequences (the sequences of 
functions which converge to the delta distribution) has been spelled out in some detail, . 



WO 99/46731 



PCT/US99/05426 



257 

Qualitatively, delta sequences are father wavelets (scale functions). If a delta sequence 
is an orthogonal system, it is found to span the wavelet subspace V 0 in a multiresolution 
analysis. Various delta sequences arising in mathematical, physical and engineering 
applications are reviewed. 

A set of wavelet generators is constructed for converting delta sequences into 
mother wavelets. These generators are connected with an infinite dimensional Lie 
algebra which has an extremely simple algebraic structure and includes the algebra of 
translation and dilation operations as an invariant subalgebra. The corresponding Lie 
group provides the basis for a mathematical description of wavelets, which is more 
general than the usual translation and dilation group. A new set of orthogonal wavelets 
is found in the case of Dirichlet's continuous delta sequence. The well-known Mexican 
hat wavelet has been shown to be a special case of a variety of the Hermite wavelets, and 
has been derived by two distinct approaches. 

The general connection between wavelet bases and frames and conventional 
L 2 (a,6) polynomial bases was briefly discussed. Essentially, the father wavelet 
corresponds to the lowest order polynomial and all higher order polynomials are related 
to the mother wavelets, provided that the polynomials are orthogonal with respect to 
some weight. 

Distributed approximating functional (DAFs) were defined as mulitparameter 
delta sequences of the Dirichlet type, constructed using Schwartz-class functions. DAFs 
were classified as orthogonal and nonorthogonal. The former are constructed by 
orthogonal basis expansion of the delta distribution, and the latter are constructed by the 
method of regularization. Both orthogonal and nonorthogonal DAFs are frames. The 
construction of orthogonal DAFs is briefly reviewed and more details can be found in 
Ref. [28]. The construction on nonorthogonal DAFs is described in terms of Fourier 
space regularization. A general and efficient procedure for generating interpolating 
DAFs is presented. The connection between the DAF approach and mathematical 
sampling theory is discussed in detail. Various examples are given to illustrate our 
approaches. Clearly, there are infinitely many more DAFs . which can bejeasily 
constructed using our approach. For example, the well known formula H^cos^x) can 
be used to generate a DAF. It is not possible in this paper to enumerate all the various 
possibilities. 
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A method of creating arbitrarily smoothed and arbitrarily shaped window 
functions is briefly discussed, based on regularization. Smooth low pass, high pass, band 
pass and band stop filters are constructed as special cases. The desired degree of 
smoothness is attainable by using an appropriate (S or C m ) regularizing function. 

A variety of DAF- wavelets (wavelets generated by using DAFs) is constructed 
by using either our wavelet generators or the difference method. The Hermite DAF is 
used to illustrate our approach because in that case, analytical forms can be obtained 
easily. The Mexican hat wavelet is identified as a special case of the Hermite-DAF 
wavelets. We expect that various DAF wavelets will play an important role in a wide 
variety of numerical applications. 

All references (articles and patents) referenced or cited in this disclosure are 
incorporated herein by reference. Although the invention has been disclosed with 
reference to its preferred embodiments, from reading this description those of skill in the 
art may appreciate changes and modification that may be made which do not depart from 
the scope and spirit of the invention as described above and claimed hereafter. 
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CLAIMS 

We Claim: 

1. A method for data padding and noise filtering implemented on a digital 
processing device, comprising the steps of: - 

a. defining a total data set as the collection of all known and unknown data, 
where the unknown values may be interspersed among the known values, 
or concentrated in a region adjacent to the known values, or a 
combination of the two; 

b. placing the unknown values of the data to be obtained by a padding 
procedure so that the total data set contains only equally space data — the 
data has a constant sampling interval and the total data set represents a 
function; 

c. calculating an appropriate difference between the true data and an 
approximation to the data for all known and unknown data values; 

d. minimizing the difference with respect to the unknown data values by 
iteration or using the calculus of variations to obtain algebraic equations 
for the unknown values or solving the algebraic equation for the unknown 
data values. 

2. The method of claim 1, further comprising the step of: 

e. removing a noise from the total data set using a non-interpolating, well- 
tempered distributed approximating functional (NIDAF)-low-band-pass 
filter to form an approximation to the total data set. 

3 . The method of claim 1 , wherein the data is a time-sequence of multi-dimensional 
data. 

4. The method of claim 2, wherein time-sequence is a 2-dimensional sampling of 
an image at a fixed time interval. 

5 . The method of claim 1 , wherein the data is 1 -dimensional data. - 
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6. A method for data padding and noise filtering implemented on a digital 
processing device, comprising the steps of: 

a. defining a total data set as the collection of all known and unknown data, 
where the unknown values may be interspersed among the known values, 
or concentrated in a region adjacent to the known values, or a 
combination of the two; 

b. placing the unknown values of the data to be obtained by a padding 
procedure so that the total data set contains only equally space data — the 
data has a constant sampling interval and the total data set represents a 
function; 

c. calculating an appropriate difference between the true data of the total 
data set and a D AF approximation to the data for all known and unknown 
data values; and 

d. minimizing the difference with respect to the unknown data values by 
iteration or by using the calculus of variations to obtain algebraic 
equations for the unknown values or solving the algebraic equation for the 
unknown data values. 

7. The method of claim 6, further comprising: 

b. when the known input data set comprises a finite number of sequential 
data values having an evenly spaced distribution, adding a gap of a finite 
number of unknown data having the same spacing or sampling rate and 
some other functional dependence or tail function; 

c. calculating an appropriate difference between the true data (known and 
unknown, including gap points and tail function points) and a DAF 
approximation to the data values including values at gap points and the 
unknown coefficients in the tail function or sum of basis functions and 
choosing DAF parameters so that the largest predicted or estimated data 
values do not exceed the largest known input values - and the towest 
estimated or predicted data values are not less than the lowest known 
input values; and 

d. minimizing this difference with respect to all unknown values and. 
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coefficients by iteration or by using the calculus of variations to obtain 
algebraic equations for the unknowns or solving the algebraic equations 
for the unknowns. 

8. The method of claim 6, further comprising: 

b. when the known data set comprises a finite number of evenly spaced, 
sequential data points, adding a gap with a finite number of data point 
having the same spacing or sampling rate and adding a repetition of the 
finite number of evenly spaced, sequential data points to give a periodic 
extension of the known input data, where a sign associated with the 
periodic extension is the same or opposite of the input data and-where the 
same sign periodic extension yields a symmetric extension of known 
input data and the opposite sign periodic extension yields an anti- 
symmetric extension of known input data; 

c. calculating an appropriate difference between the true data (known and 
unknown, including gap points) and a DAF approximation to this data 
and choosing DAF parameters so the largest estimated or predicted data 
do not exceed the largest known input data and the lowest estimated or 
predicted data are not less than the lowest known input data; and 

d. minimizing this difference with respect to the unknown data values by 
iteration or using the calculus of variations to obtain algebraic equations 
for the unknown values or solving the algebraic equations for the 
unknown gap data values. 

9. The method of claim 2, further comprising: 

b. when the known input data set comprises a finite number of evenly 
spaced (or sampled), but sequential data points, adding interspersed 
unknown data points to construct an evenly spaced or sampled data set to 
form an augments evenly spaced sequential data set, adding a gajp with 
unknown data having the same spacing or sampling rate and then 
repeating the augmented evenly spaced sequential data set after the gap 
to form a periodic, evenly spaced extension of the known data where the 



WO 99/46731 



PCT/US99/05426 



262 

extension have the same or opposite sign of the augmented data set where 
a same sign extension yields a symmetric periodic extended data set and 
an opposite sign yields an anti-symmetric periodic extended data set; 

c. calculating an appropriate difference between the true data (known and 
unknown, including gap and interspersed unknown data) and a DAF 
approximation to the total data set and choosing DAF parameters so the 
largest estimated or predicted data do not exceed the largest known input 
data, and so that the lowest estimated or predicted data are not less than 
the lowest known input data; 

d. minimizing this difference with respect to the unknown*data values and 
gap and interspersed unknown data by iteration or using the calculus of 
variations to obtain algebraic equations for the unknown values or solving 
the algebraic equations for the unknowns; 

10. A method for image processing implemented on a digital processing device, 
comprising the steps of: 

associating data of an image with a grip of points; 

associating a measure of disorder parameter with the data of the image; 

padding the data and periodically extending it as in claims 4 or 5, using a DAF 
approximation to the data with the DAF parameters chosen so that the largest estimated 
or predicted data values do not exceed the largest known input data and the lowest 
estimated or predicted data values are not less than the lowest know input data; 

filtering out the noise by using a well-tempered NIDAF-low-band-pass-filter to 
approximate the total data set and calculating the disorder parameter as a function of the 
DAF parameters M and olA. The change in the disorder parameter as a function of dA 
(for a given M) will be used as the indicator for the optimum well-tempered NIDAF-low- 
band-pass-filter; 

if desired, one can also use the periodically extended and filtered total data set to 
carry out further standard Fourier filtering. 

11. The method of claim 10, further comprising: 

categorizing the images and patterns using the final disorder parameter; 
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images or patterns with the same disorder parameter will be assumed to be 

similar; 

a catalogue of such images or patterns, and their corresponding disorder 
parameters, will be used to construct a comparison and categorization of surface 
roughness, as well as other properties that generate images and patterns; 

similar catalogues of such images and/or patterns of stressed materials will be 
used to identify incipient structural flaws or weakness; 

similar catalogues of fractured or on-going fracturing materials will be used to 
identify microscopic fractures or structural flaws; 

similar catalogues of patterns associated with any kind of -defect will be 
constructed and used for predictive diagnostics. 

12. A method for data padding and noise filtering implemented on a digital 
processing device, comprising the steps of: 

a. defining a total data set as the collection of all known and unknown data, 
where the unknown values may be interspersed among the known values, 
or concentrated in a region adjacent to the known values, or a 
combination of the two; 

b. placing the unknown values of the data to be obtained by a padding 
procedure so that the total data set contains only equally space data — the 
data has a constant sampling interval and the total data set represents a 
function; 

c. calculating an appropriate difference between the true data of the total 
data set and a D AF approximation to the data for all known and unknown 
data values; 

d. solving the algebraic equation for the unknown data values. 

13. The method of claim 1, further comprising the step of: 

e. removing a noise from the total data set using a non-interpolating, well- 
tempered distributed approximating functional (NIDAF)-low-band-pass 
filter to form an approximation to the total data set. 
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Fig. I The Hermite DAF (a) in coordinate space, (b) in frequency space, (c) the first order derivative in 
coordinate space. The solid line is for o=3.54, A*=120 and the dashed line is for o=2.36, M=30 
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Fig.2 Original Lena image (240x240). 



Fig.3 (a) The degraded Lena image, PSNR=22.14dB. (b) Restored Lena image, PSNR=30.l4dB 
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Fig.4 (a) The degraded Lena image, PSNR=1 8.76dB. (b) Restored Lena image, PSNR=28.19dB 
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Fig.l Hcrmitc DAF (A/=8, o=l) 




PR interval QT interval 

Fig.2 ECG Criterion Characteristics for Diagnosis 




(a) (b) . (c)- 

Fig J ECG filtering (a) Original ECG (b) Low pass filtering (c) DAF wavelet net filtering 




(a) (b) (c) 

Fig.4 EMG filtering (a) Original EMG (b) Low pass filtering (c) DAF wavelet net filtering 
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Fig. 1 Image restoration from 60% impulse noise, (a) Corrupted image, (b) Filtering by Sun and 
Nevou's median switch scheme, (c) Our filtering, (d) Our modified filtering. 
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Fig. 2 Image restoration from 40% impulse noise, (a) Corrupted image, (b) Median filtering (3x' 
(c) Median fdtering (5x5). (d) Our filtering. 
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PR interval QT interval 

Fig3. ECG Criterion Characteristics for Diagnosis 




(a) (b) - (c), 

Fig.4 ECG Filtering, (a) Original ECG (b) low-pass filtering (c) our filtering 
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(a) (b) 
Fig. 1. n band-limited interpolating wavelets 
(a) Sine function (b) Sinclet wavelet 




Fig. 2. Interpolating Cardinal Spl ine (0=5) 




(a) (b) 
Fig. 3. Interpolating wavelets by auto-correlation shell (D=3) 
(a) Daubechies wavelet (b) Dubuc wavelet 




Fig. 4. Lifting scheme 
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(a) (b) 




(c) (d) 
Fig. 5. Lagrange Wavelets with D=3 
(a) Scaling (b) Wavelet (c) Dual scaling (d) Dual wavelet 




(a) (b) 
Fig. 6. Frequency Response of Equivalent Filters (D«3) 
(a) Decomposition (b) Reconstruction 
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Fig. 8. Frequency Response of Equivalent Filters (D=9) 
(a) Decomposition (b) Reconstruction 
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(a) 




(b) 
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W (e) 
Fig. 9. Non-regularized Lagrange Wavelets (M=5) 
(a) Lagrange polynomial (b) Scaling (c) Wavelet 

(d) Dual scaling (e) Dual wavelet 
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Fig. 1 1. Frequency Response of Equivalent Filters (#=4, tj=2) 
(a) Decomposition (b) Reconstruction 
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Fig. 12. Mother Wavelet Comparison (AM, 17=2) 
Solid: B-spIine Lagrange; dotted: Gaussian Lagrange 




(a) (b) 
Fig. 13. Nonlinear Masking Functional 
(a) Donoho Hard Logic Noniineariry (b) Softer Logic Nonlinearity 
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(b) 
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(c) 

Fig. 15. VGN image processing for Lena 
(a) Noisy Lena (b) Median filtering result (c) our method 
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W (b) 
Fig. 1 . ;r band-limited interpolating wavelets 
(a) Sine function (b) Sinclet wavelet 




Fig.2. Interpolating Cardinal Spline (£>=5) 





Fig. 4. Lifting scheme 
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(c) 



(d) 



F '8-5. Lagrange Wavelets with D=3 
(a) Scaling function (b) Wavelet (c) Dual scaling function (d) Dual wavelet 





Fig.6. Frequency Response of Equivalent Filters (0=3) 
(a) Decomposition (b) Reconstruction 
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Mc F ! S- 8 - B-Sphnc Lagrange DAF Wavelets (jV=4, n=2) 

(a) Scaling function (b) Wavelet (c) Dual scaling function (d) Dual wavelet 





(a) 



(b) 



Fig.9. Frequency Response of Equivalent Filters (A'=4 n=2) 
(a) Decomposition (b) Reconstruction 




Fig. 1 0. Mother wavelet comparison (AM, 77=2) 
Solid: B-splinc Lagrange; dotted: Gaussian-Lagrange 



WO 99/46731 PCT/US99/05426 

49 / 62 



• 
1 

1 

r 




U 01 02 





Fig- ) I . Gibbs overshoot of the Sine FIR 
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(a) (b ) 
Fig. 1 6 Dubuc wavelets (£>=3) 
(a) Inner scaling (b) Boundary scaling 
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(a) (b) 
Fig. 17 Generalized Sine wavelets (//=2, rj=2) 
(a) Inner scaling (b) Boundary scaling 




(a) (b) 
Fig. 1 8 Frequency response comparison of boundary filters 
(a) Halfband Lagrange wavelet (b) Sinc-DAF wavelet 





(a) (b) 

Fig. 1 9 Target extraction from color background 
(a) Original pilot view (b) DAF-wavelct restoration 
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(a) (b) 
Fig.20. Visual Color Image Restoration 
(a) Noisy girl (b) Our restoration 
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(a) 0>) 
Fig. I. it band-limited interpolating wavelets 
(a) Sine function (b) Sinciet wavelet 




Fig.2. Fourier Gibbs overshot of Sine FIR implementation 
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(c) (d) 
Fig. 3. Sine Cutoff Wavelets {M=9) 
(a) Scaling (b) Wavelet (c) Dual scaling (d) Dual wavelet 




(c) (d) 
Fig.4. B-Spline Lagrange DAF Wavelets (Jv^=5, /p3) 
(a) Scaling (b) Wavelet (c) Dual scaling (d) Dual wavelet 
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(a) (b) 
Fig.5. Frequency Response of Equivalent Filters (M=5. //=3) 
(a) Decomposition (b) Reconstruction 




Fig 6 Mother Wavelet Comparison (AM, /?-2) 
Solid: B-spline Sine; dotted: Gaussian Sine 
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Fie 7 Nonlinear Masking Functional 
Donoho IWUgic Nonlinearity (b) Softer Logic Nonlineanty 



WO 99/46731 



57 / 62 



PCT/US99/05426 




(b) 

Fig.8. Mammogram enhancement 
(a) Original mammogram (b) Multiresolution enhancement by 
DAF-wavelet 
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Fig. 1. Cube model of RGB color 
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Fig.2. Alternative representation 
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Fig.3. Hexagon projection of color tube 
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(a) (b) (c) 

Fig. 1 traditional trivial window 
(a) 1 x 1 window (single pixel), (b) 3x3 window, (c) 5x5 window. 



HESn H' 

>— 9— o— < • 6—0— ♦ 



r? 



>— o 



9~9-ft_AJ> 

I I : l t I (till 

9 o-~ 6 ~<j>™<v-<s •-.6-~p-^— ♦ 

! i i i I I S i i i 1 
~ - 0 — ft— O — • — 0 



•- 

i 



I I 1 1 T 1 x i i T I 

—9 — o — 0 — • #—0-0—0 — • 6— — O 

o— o— o— o •— o— o— ♦ • — • — ft 

! ! i ' i iTil! i'lij 

t-?-?-?-? ^♦-♦-^ 



ft ft— 



0—0 — #—0—0 




" 1 I Ki i 
o— 0- 



•— o— *— ♦ 



A— • — — • 0— o— i— A — o A— A— A— A — I 



p— o 

! ! : f i I I 1 
0 •—♦—<• — • — • 

UM km 



rr? rrn 
-rn rm i 



1 

♦ — ♦ — #— # 

' 1 i ! i j i ! i 
A-6-A-6-A A-A-A-i 




Mil 

— O ft— r 



6—4- 



o--#— 9— 9 • — 9— o — • 

9— -6— o— <b ft--o-#—6— »• A— A—A— -A— 6 

i— A— A— A— A A— 6 A— -A— A — A — A 

9 — A— <i>— A #—6— -♦—A—* A— A— A— <!>— <b 

Lu^ul U-iXi u-lU 



tttii tttti 



•—♦—<? — •— • O _ _ _ _ 

1 i T7 I M ! i ! 

► — 0 — • — 0 — • • — o — • — 6 — • 

[III! ! I I J-..-j 



' t 1 1 1 1 I J- - 

»— #— ft — p ft— 



rrt 



n=rn t33=H 



<>— t-A 



"TO 




i~{ i — 4 — — T — ^> 

► ♦ -6— 6— • 





nitn 

O — ft— o — ft — o 



. . U444 



Fig.2 Arbitrary quincunx extension of symmetric neighboring basket 
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Fig.3 Quincunx basket selection for the filtering 




(a) (b) 
Fig.4. Image restoration from 40% impulse noise, (a) Corrupted image, (b) Our filtering. 
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( a ) (b) (c) ... 

Fig.5. Wavelet noise removal * 
(a) Gaussian-degraded Lena (b) DAF wavelet thresholding (c) DAF wavelet + quincunx filtering- 
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